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CHIMPANZEE ADENOVIRUS VECTORS 

This invention was supported by the National 
Institute of Health Grant No. DK47757. The United States 
government has rights in this invention. 

5 Field of the Invention 

The present invention relates to the field of 
vectors useful in somatic gene therapy and the production 
and use thereof, and also to the field of vaccines. 

Background of the Invention 

10 I. Gene Therapy 

Gene therapy is an approach to treating 
disease, generally human disease, that is based on the 
modification of gene expression in cells of the patient. 
It has become apparent over the last decade that the 

15 single most outstanding barrier to the success of gene 
therapy as a strategy for treating inherited diseases, 
cancer, and other genetic dysfunctions is the development 
of useful gene transfer vehicles. 

Eukaryotic viruses have been employed as 

20 vehicles for somatic gene therapy. Among the viral 

vectors that have been cited frequently in gene therapy 
research are adenoviruses. Adenoviruses are eukaryotic 
DNA viruses that can be modified to efficiently deliver a 
therapeutic or reporter transgene to a variety of cell 

25 types. Human adenoviruses are composed of a linear, 

approximately 36 kb double-stranded DNA genome, which is 
divided into 100 map units (m.u.), each of which is 360 
bp in length. The DNA contains short inverted terminal 
repeats (ITR) at each end of the genome that are required 

30 for viral DNA replication. The gene products are 

organized into early (El through E4) and late (LI through 
L5) regions, based on expression before or after the 
initiation of viral DNA synthesis [see, e.g., Horwitz, 
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ViroloaY f 2d edit., ed. B. N. Fields, Raven Press, Ltd., 
New York (1990) J. 

Recombinant adenoviruses types 2 and 5 
(Ad2 and Ad5, respectively), which cause respiratory 
5 disease in humans, are currently being developed for gene 
therapy. Both Ad2 and Ad5 belong to a subclass of 
adenovirus and are not associated with human 
malignancies. 

Recombinant adenoviruses are capable of 

10 providing extremely high levels of transgene delivery to 
virtually all cell types, regardless of the mitotic 
state. High titers (10^^ plaque foirming units/ml) of 
recombinant virus can be easily generated in an 
adenovirus-transformed, hxman embryonic kidney cell line 

15 293 [ATCC CRL1573]. The 293 cell line contains a 
functional adenovirus Ela gene which provides a 
transacting Ela protein. It can be cryo-stored for 
extended periods without appreciable losses. 

The efficacy of this system in delivering 

20 a therapeutic transgene in vivo that complements a 

genetic imbalance has been demonstrated in animal models 
of various disorders [K. F. Kozarsky et al. Somatic Cell 
Mol. Genet 19:449-458 (1993) ("Kozarsky et al I"); K. 
F. Kozarsky et al, J. Biol. Chem. , 269:13695-13702 (1994) 

25 ("Kozarsky et al II) ; Y. Watanabe, Atherosclerosis , 
16:261-268 (1986); K. Tanzawa et al, FEBS Letters . 
118(1) :81-84 (1980); J.L. Golasten et al, New Enal. J. 
Med. . 309 :288-296 (1983); S. Ishibashi et al, J. Clin. 
Invest. . 22:883-893 (1993); and S. Ishibashi et al, 

30 Clin. Invest, . 22:1885-1893 (1994)]. Indeed, a 

recombinant replication defective adenovirus encoding a 
cDNA for the cystic fibrosis transmembrane regulator 
(CFTR) has been approved for use in at least two human OF 
clinical trials [see, e.g., J. Wilson, Nature, 365 ; 691- 

35 692 (Oct. 21, 1993)]. The use of adenovirus vectors in 
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the transduction of genes into hepatocytes in vivo has 
previously been demonstrated in rodents and rabbits [see, 
e.g., Kozarsky II, cited above, and S. Ishibashi et al, 
J. Clin. Invest. . 92:883-893 (1993)]. Further support of 
5 the safety of recombinant adenoviruses for gene therapy 
is the extensive experience of live adenovirus vaccines 
in human populations. 

However, many humans have pre-existing 
immunity to human adenoviruses as a result of previous 

10 natural exposure, and this immunity is a major obstacle 
to the use of recombinant human adenoviruses for gene 
therapy protocols. 

II. Vaccines 

Replication competent, recombinant 

15 adenovirus (Ad) containing a variety of inserted genes 

have been used as vaccine compositions with some success 
[see, e.g. Davis, U.S. Patent No. 4,920,309]. Others 
have described the insertion of a foreign gene into a 
live [L. Prevac, J. Infect. Pis. . 161:27-30 (1990)] and a 

20 replication-defective adenovirus for putative use as a 
vaccine [See, e.g. T. Ragot et al, J. Gen. Virol. . 
74.:501-507 (1993); M. Eliot et al, J. Gen. Virol. . 
71:2425-2431 (1990); and S. C. Jacobs et al, J. Virol. . 
M:2086-2095 (1992)]* Jacobs et al, cited above, 

25 describes a recombinant El-deleted, E3 intact. Ad 

containing encephalitis virus protein NSl under the 
control of a heterologous cytomegalovirus (CMV) promoter. 
When mice were immunized with the recombinant Ad vaccines 
and challenged with virus, Jacobs et al obtained partial 

30 protection (at most a 75% protection) for an average 

survival of 15 days. Eliot et al, cited above, descril^e 
a recombinant El-deleted, partially E3-deleted Ad with 
pseudorabies glycoprotein 50 inserted into the El 
deletion site under the control of a homologous Ad 

35 promoter. In rabbits and mice, after immunization and 
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challenge, only partial protection was obtained (i.e., 
about one-third) . Ragot et al, cited above, describe a 
recombinant El-deleted, partially E3-deleted Ad with 
Epstein Barr virus glycoprotein gp34 0/220 inserted into 
5 the El deletion site under the control of a homologous Ad 
promoter* In marmosets (tamarins) after three high dose 
(5X10^ pfu, 1X10^^ pfu and 2X10^^ pfvi) , intramuscular 
immunizations and viral challenge, full protection was 
obtained. 

10 For certain highly infectious diseases, 

there is a demand for an effective vaccine. Desirably, a 
vaccine should be effective at a low dosage to control 
the occurrence of side effects or to enable sufficient 
amounts of vaccine to be introduced into the animal or 

15 human. 

There exists a need in the gene therapy art for 
the development of additional adenovirus vector 
constructs that do not stimulate immediate immune 
responses which quickly eliminate the recombinant virus 
20 and the therapeutic transgene from the patient. There 
also exists a need in the vaccine art for new vaccine 
carriers, which are safe and effective in humans and 
other mammals. 



Summarv of the Invention 
25 The present invention meets the need in the art 

by providing adenovirus nucleotide sequences of 

chimpanzee origin, a variety of novel vectors, and cell 

lines expressing chimpanzee adenovirus genes. 

In one aspect the invention provides the 
30 nucleotide sequence of a chimpanzee CI adenovirus. See 

SEQ ID NO: 1. 

In another aspect the invention provides the 

nucleotide secpience of a chimpanzee C68 adenovirus. See 

SEQ ID NO: 2. 
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In a further aspect, the invention provides a 
recombinant adenovirus comprising the DNA sequence of a 
chimpanzee adenovirus and a selected heterologous gene 
operatively linked to regulatory sequences directing its 
5 expression* The recombinant virus is capable of 

infecting a mammalian, preferably a human, cell and 
capable of expressing the heterologous transgene product 
in the cell. In this vector, the native chimpanzee El 
gene, and/or E3 gene, and/or E4 gene may be deleted. A 

10 heterologous gene may be inserted into any of these sites 
of gene deletion. The heterologous transgene may encode 
a normal or therapeutic gene which, upon expression, 
replaces or modifies an inherited or acquired genetic 
defect. The heterologous gene may be an antigen against 

15 which a primed immune response is desired (i.e., a 
vaccine) . 

In another aspect, the invention provides a 
mammalian cell infected with the viral vector described 
above . 

20 In still a further aspect of this invention, a 

novel mammalian cell line is provided which expresses a 
chimpanzee adenovirus gene or functional fragment 
thereof • 

In still a further aspect, the invention 
25 provides a method for delivering a transgene into a 

mammalian cell comprising the step of introducing into 
the cell an effective amount of a recombinant virus 
described above. 

Another aspect of this invention is a method 
30 for delivering to a mammalian patient having a disorder 
related to an inherited or acquired genetic defect a 
desired transgene. The method comprises the step of 
administering to the patient by an appropriate route an 
effective amount of an above-described recombinant 
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chimpanzee adenovirus containing a normal or therapeutic 
transgene, wherein the transgene product is expressed in 
vivo. 

Still another aspect of this invention provides 
5 a method for eliciting an immune response in a mammalian 
host to protect against an infective agent. The method 
comprises the step of administering to the host an 
effective amount of a recombinant chimpanzee adenovirus 
comprising a heterologous gene that encodes an antigen 
10 from the infecting organism against which the immune 
response is targeted. 

Other aspects and advantages of the present 
invention are described further in the following detailed 
description of the preferred embodiments thereof. 



15 Brief Description of the Drawings 

Fig. lA is a diagrammatic bar graph 
illustrating the structure of the chimpanzee adenovirus 
CI (also referred to as C-1) and the location of the 
adenovirus genes thereon by nucleotide position and by 

20 map unit numbers appearing under the bar graph. The 
locations of the late genes (L-1 through L-5) are 
represented by arrows below the graph with molecular 
weight indications above the arrows and nucleotide 
positions below the arrows. The location of the E2a 

25 region early TATA box and transcriptional start site was 
not determined. The E2a region is estimated to begin 
approximately at nucleotide 27,100. The position of the 
translation initiation codon for the E2a encoded DNA 
binding protein is indicated by an asterisk. 

30 Fig. IB is a line graph showing the correlation 

between map units and nucleotide (base) pairs of the 
sequence of CI [SEQ ID NO: 1]. 
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Fig, IC is a bar graph illustrating the various 
Bam HI clones obtained for the CI Ad, indicating 
nucleotide numbers , fragment size in nucleotides, clone 
numbers, and fragment boundaries in nucleotides. 
5 Fig. 2 is a tabular comparison of Cl and C68 

predicted amino acid secjuences examined for homology to 
previously described adenoviral protein sequences, Ad4, 
Ad5, Ad7, Adl2, and Ad40. Symbol "a" indicates that 
comparison of fragments of different size resulted in an 

10 underestimate of homology. Symbol "b" indicates a 95% 
identity from Ad-4 aa 1-95. A possible mistake in 
sequence apparently resulted in a frameshift and 
premature termination in this comparison. Symbol "c" 
indicates that Ad-5 has 2 small ORF's in this region 

15 encoding proteins of 64 and 67 residues with 

approximately 50% amino acid identity with, respectively, 
the amino and carboxy halfs of the chimp Ad homologs. 
Symbol "d" indicates that Ad-3 and Ad-7 fragments were 
not sequenced for this protein. Symbol "e" indicates 

20 that Ad-3 5 and Ad-4 were not sequenced for this protein. 
Symbol "f" indicates that the reported sequence for Ad-7 
pVIII is 197aa, and the homology begins at aa30 of the 
chimp Ad sequences. The homology between the chimp Ad's 
and Ad-7 for the 197 aa region is 98% for C-1 and 90% for 

25 C-68. 

Fig. 3A is a diagrammatic bar graph 
illustrating the structure of the chimpanzee adenovirus 
C68 and the location of the adenovirus genes thereon by 
nucleotide position and by map unit numbers appearing 

30 under the bar graph. The locations of the late genes are 
represented as described for Fig. lA. The location of 
the E2a region early TATA box and transcriptional start 
site was not determined. The E2a region is estimated to 
begin approximately at nucleotide 26,800. The position 

35 of the translation initiation codon for the E2a encoded 
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DNA binding protein is indicated by an asterisk. 
Although the entire genome of C68 has been cloned, 
certain of the fragments in Fig. 3 have been individually 
cloned (white bars) or not cloned (shaded bars) • 
5 Fig. 3B is a line graph showing the correlation 

between map units and nucleotide (base) pairs of the 
sequence of C68 [SEQ ID NO: 2]. White and shaded boxes 
are defined as in Fig* 3A. 

Fig. 3C is a bar graph illustrating the various 
10 Pst fragments obtained for the C68 Ad, indicating 

nucleotide numbers, fragment sizes in nucleotides, clone 
numbers and fragment boundaries in nucleotides. White 
and shaded boxes are defined as in Fig. 3A. 

Fig. 3D is a bar diagram illustrating Bam HI 
15 fragments of the C68 genome indicating nucleotide 

numbers, fragment size in nucleotides, clone numbers, and 
fragment boundaries in nucleotides. White and shaded 
boxes are defined as in Fig. 3A. 

Fig. 3E is a bar diagram illustrating the 
20 Hindlll-B fragment and its nucleotide boundaries and 

size. White and shaded boxes are defined as in Fig. 3 A. 

Fig. 4 A is a more detailed schematic drawing of 
pC68-CMV-LacZ . 

Fig. 4B is a schematic representation of pBS- 

25 Notx2. 

Fig. 5A is a schematic drawing of plasmid 
pGPGK. The arrow indicates the direction of the murine 
PGK promoter. Restriction sites and marker genes are 
conventionally labeled. 

30 Fig. 5B is a schematic drawing of plasmid pNEB- 

C68BamE. This plasmid contains fragments of the LacZ 
gene (small arrow) flanking either side of the bar 
indicating the C68 Ad BamE fragment. The large arrow 
illustrates the Amp^ gene. Restriction sites and marker 

35 genes are conventionally labeled. 
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Fig. 5C is a schematic drawing of plasmid 
pGPGK-C68BamE in which the BamE fragment from pNEB- 
C68BamE has been cloned downstream from the PGK promoter 
of pGPGK. 

5 Fig. 5D is a representation of the PGR 

amplification of the C68 sequence from pNEB-C68BamE, 
illustrating the use of primers to introduce a Kpnl site 
just upstream of the C68 El region translation initiation 
codon at nucleotide 576 of the C68 genomic DNA and reduce 

10 the sequence distance between the promoter and C68 coding 
sequence. Location of the primers is indicated. 

Fig. 5E is a schematic drawing of plasmid 
PGPGK-C68E1-ATG, in which the ATG translational start 
codon was moved closer to the PGK promoter. 

15 Fig. 5F is a schematic drawing of plasmid pBS- 

C68BamF, in which the BamF fragment was cloned into the 
BamHI site of pGPGK-C68El-ATG to generate pGPGK-C68El 
(Fig. 5G) . 

Fig. 5G is a schematic drawing of plasmid 
20 pGPGK-C68El, containing the complete chimpanzee C68 Ad El 
region under the control of the murine PGK promoter. 

Fig. 6A is a schematic drawing of plasmid 
pGPGK, a duplication of Fig. 5A for purposes of 
explaining construction of the CI Ad El expression 
25 plasmid. 

Fig. 6B illustrates the isolation of the 5* end 
of the CI El region as a 1.9kb SnaBI - Xbal fragment. 

Fig. 6C illustrates the use of primers to 
introduce by PCR amplification a Kpnl site just upstream 
30 of the CI El region translation initiation codon El-ATG 
at nucleotide 577 of the 01 genomic DNA. 

Fig. 6D is a schematic drawing of plasmid 
pGPGK-Cl mul.3-6.6 (7.4kb). 

Fig. 6E is a schematic drawing of plasmid 
3 5 pGPGK-Cl-ElATG . 
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Fig. 6F is a schematic drawing of plasmid pBS- 

ClBamI • 

Fig. 6G is a schematic drawing of plasmid 
pGPGK-ClEl^ containing the complete chimpanzee CI Ad El 
5 region under the control of the murine PGK promoter. 

Fig, 7A is a schematic drawing of plasmid 
pSP72-Pac with indicated restriction endonuclease enzyme 
cleavage sites. 

Fig. 7B is a schematic drawing of plasmid pNEB* 

10 Cl-BamG. 

Fig. 7C is a schematic drawing of plasmid 
pSP-Cl-muO-1 . 3 . 

Fig. 7D is a schematic drawing of plasmid pCMV- 

B. 

15 Fig. 7E is a schematic drawing of plasmid 

pSP-Cl-muO-1. 

Fig. 7F is a schematic drawing of plasmid 

pGEM-3Z. 

Fig. 7G is a schematic drawing of plasmid 
20 pBS-Cl-Baml. 

Fig. 7H is a schematic drawing of plasmid 
p6EM-Cl-mu9-10 . 

Fig. 71 is a schematic drawing of plasmid 
pBS-Cl-BamE. 

25 Fig. 7 J is a schematic drawing of plasmid 

pGEM-Cl-mu9-17 . 

Fig. 7K is a schematic drawing of plasmid 

pCl-CMV-LacZ^ illustrating CI Ad mu 0 to 1.3, followed by 

the CMV promoter, a splice donor/splice acceptor sequence 
30 (SD/SA) ^ the LacZ gene, a SV40 poly A sequence and CI Ad 

mu 9-17^ and additional plasmid sequence. The plasmid 

also contains an ori and Amp^ sequence. 

Fig. 8A is a schematic drawing of pSP72-Pac 

with indicated restriction endonuclease enzyme cleavage 
35 sites. 
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Pig. 8B is a schematic drawing of pNEB-C68- 
Fig. 8C is a schematic drawing of pSP-C68-mu 



5 Fig. 8D is a schematic drawing of pCMV-B. 

Fig. 8E is a schematic drawing of pSP-C68-mu 
0-1.3-CMV-6. 

Fig. 8F is a schematic drawing of pGEM-3Z. 
Fig. 8G is a schematic drawing of pBS-C68-BamF. 
10 Fig. 8H is a schematic drawing of 

pGEM-C68-mu9-10. 

Fig. 81 is a schematic drawing of pBS-C68-BamB. 
Fig. 8 J is a schematic drawing of 
pGEM-C68-mu9-16 . 7 . 
15 Fig. 8K is a schematic drawing of 

pC68-CMV-LacZ^ illustrating C68 Ad mu 0 to 1.3, followed 
by the CMV promoter, an SD/SA, the LacZ gene, a SV40 poly 
A sequence and C68 Ad mu 9-16.7, and additional plasmid 
sequence. The plasmid also contains an ori and an Amp^ 
20 sequence. 

Fig. 9A is a schematic drawing of pEGFP-1 
(Clontech, Palo Alto, CA) . 

Fig. 9B is a schematic drawing of a Not-I 
synthetic linker (New England Biolabs) . 
25 Fig. 9C is a schematic drawing of pEGFP-Notx2. 

Fig. 9D is a schematic drawing of pCl-CMV-LacZ 
(from Fig. 7K) . 

Fig. 9E is a schematic drawing of pC68-CMV-LacZ 
(from Fig. 8K) . 

30 Fig. 9F is a schematic drawing of pCl-CMV-GFP, 

in which the GFP coding region replaces the LacZ gene of 
pCl-CMV-LacZ. 

Fig. 96 is a schematic drawing of pC68-CMV-GFP, 
in which the GFP coding region replaces the LacZ gene of 

35 pC68-CMV-LacZ . 
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Fig. lOA is a schematic drawing of pC68-CMV-GFP 
as discussed in Fig. 9G« 

Fig. lOB is a schematic drawing of the C68 

genome* 

5 Fig. IOC is a schematic drawing of the C68*- 

Sspl-A fragment, which is 35,199 nucleotides. 

Fig. lOD is a schematic drawing of the C68-CMV- 
GFP genome, which is formed by homologous recombination 
between the C68 mu 9-16.7 sequence in pC68-CMV-GFP and 
10 the homologous sequence in the C68-Sspl-A fragment. 

Fig. IIA is a schematic drawing of pNEB-Cl- 

BamG. 

Fig. IIB is a schematic drawing of the CI 

genome. 

15 Fig. lie is a schematic drawing of pNEB-Cl- 

Ascl-B. 

Fig. IID is a schematic drawing of a Not-I 
synthetic linker (New England Biolabs) . 

Fig. HE is a schematic drawing of pNEB-Cl- 
20 Ascl-B-Notl. 

Fig. IIF is a schematic drawing of the CI 

genome * 

Fig. IIG is a schematic drawing of the Ascl-A 
fragment of the CI genome. 
25 Fig. IIH is a schematic drawing of the CI 

genome engineered to have a unique NotI site replacing 
the Spe-I site in the ElB 2 IK protein coding region. 

Detailed Description of the Invention 

The present invention provides novel adenovirus 
30 vectors and packaging cell lines to produce those vectors 
for use in the in vitro production of recombinant 
proteins or fragments or other reagents, and for use in 
the treatment of inherited or acquired genetic disorders 
and abnormalities in humans and other mammals. The 
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present invention also provides novel vaccine 
compositions which comprise those vectors, the vectors 
comprising an inserted heterologous gene encoding an 
antigen from an infectious agent. 
5 The methods of the invention involve delivering 

one or more selected heterologous gene(s) to a mammalian 
patient by administering a vector of the invention. 
Because the various vector constructs are derived from 
chimpanzee rather than from human adenoviruses, the 

10 immune system of the patient is not primed to respond 
immediately to the vector as a foreign antigen. A 
similar response would be expected where the patient was 
any mammal other than chimpanzee. 

Use of the compositions of this invention thus 

15 permits a more stable expression of the selected 
transgene when administered to a non-chimpanzee, 
preferably human patient. Use of the compositions of 
this invention for vaccination permits presentation of a 
selected antigen for the elicitation of protective immune 

20 responses. The recombinant chimpanzee adenoviruses of 
this invention may also be used for producing 
heterologous gene products in vitro • 

J. Cloning of Chimpanzee Adenoviirus Sequences 
Chimpanzee adenovirus, strain Bertha or CI 

25 [ATCC Accession No. VR-20] and chimpanzee adenovirus, 
strain Pan-9 or CV68 [ATCC Accession No. VR-594] were 
obtained from the American Type Culture Collection, 12301 
Parklawn Drive, Rockville, MD. For convenience, the 
virus CV68 is referred to throughout this specification 

30 as "ces". The viruses were originally isolated from 
feces [CI, Rowe et al, Proc. Soc. Exp. Med. . 21:260 
(1956)] or mesenteric lymph node [C68, Basnight et al, 
Am> J. Epidemiol. . jgi4:166 (1971)] of infected 
chimpanzees. 
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Little is known about these viruses. 
However, limited restriction and iimunological analyses 
have been published. For example, CI was shown to be 
most similar to Subgroup B human adenoviruses, but it was 
5 not neutralized by heterologous sera, and no 

hemagglutination inhibition was observed [Wigand et al, 
Intervirolocry , 30:1 (1989)]. Restriction analysis 
demonstrated that C68 was most similar to human Ad4 
serotype (Subgroup E) , but only 1 in 16 enzymes tested 
10 did not distinguish C68 and Ad4 [Kitchingman, Gene . 
20:205 (1982)]. 

Both chimpanzee adenoviruses grow well in 
human cells and were propagated in human embryonic kidney 
293 cells. As described in detail in Examples 1 and 2 

15 below, genomic DNA was isolated from purified virus 

stocks and digested with a panel of restriction enzymes 
and the restriction fragments cloned and sequenced. The 
genomic nucleotide sequence of Cl adenovirus is set out 
in SEQ ID NO: 1. The genomic nucleotide sequence of C68 

20 adenovirus is set out in SEQ ID NO: 2. 

Preliminary analysis of the sequence 
homology between Cl, C68 and human adenoviruses was in 
agreement with the previously mentioned immunologic or 
restriction enzyme analysis. By reference to Figs. lA-lC 

25 and 3A to 3D, it is shown that the putative El region of 
Cl occurs between about nucleotides 480 and about 3958; 
and of C68 between about nucleotides 480 and about 3956. 

Other gene regions of Cl are identified by 
homology of the Cl sequence of SEQ ID NO: 1 to the known 

30 sequences of human adenoviruses Ad3, Ad5 and Ad7. 

Similarly, other gene regions of C68 are identified by 
homology of the C68 sequence of SEQ ID NO: 2 to the known 
sequence of human adenovirus Ad4 and Ad5. The genomic 
regions encoding early gene functions for £2a, E2b, E3, 
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E4, as well as the regions of CI and C68 encoding late 
adenoviral gene products, are identified in Tables I and 
II below. 

Table I 

5 CI Chimpanzee Genome 



Gene 


Nucleotides 


Map Units 


Size fnucl./mu) 


ElA 


480-1540 


1.4-4.3 


1060/3.0 


ElB 


1566-3958 


4.4-11.1 


2392/6.7 


E2A 


23665-22065 


66.6-62.1 


1600/4.5 


E2B 


10379-3959 


29.2-11.1 


6420/18.1 


E3 


27181-31375 


76.5-88.3 


4194/11.8 


E4 


35228-32535 


99.2-91.6 


2693/7.6 


LI 


10893-13864 


30.7-39.0 


2971/8.4 


L2 


13925-17591 


39.2-49.5 


3666/10.3 


L3 


17641-22083 


49.7-62.2 


4442/12.5 


L4 


23697-27813 


66.7-78.3 


4116/11.6 


L5 


31556-32551 


88.8-91.6 


995/2.8 



Table II 



06 8 Chimpanzee Genome 



20 




N^<?l^otidefi 


Map vnit? 


Size (nucl./mu) 




ElA 


480-1521 


1.3-4.2 


1041/2.9 




ElB 


1560-3956 


4.3-10.8 


2396/6.6 




E2A 


23370-21787 


64.0-59.7 


1583/4.3 




E2B 


10346-3957 


28.3-10.8 


6389/17.5 


25 


E3 


26806-31877 


73.4-87.3 


5071/13.9 




E4 


36193-33486 


99.1-91.7 


2707/7.4 




LI 


10823-13817 


29.6-37.8 


2994/8.2 




L2 


13884-17431 


38.0-47.7 


3547/9.7 




L3 


17480-21804 


47.9-59.7 


4324/11.8 


30 


L4 


23399-27439 


64.1-75.1 


4040/11.1 




L5 


32134-33502 


88.0-91.7 


1368/3.7 



Our preliminary experiments demonstrated 
that hviman antisera do not neutralize the chimpanzee 
adenoviruses in neutralizing antibody assays (see, e-g*/ 
35 International patent application PCT95/03035) , thus 
indicating the desirability of vectors prepared from 
these sequences for gene therapy in humans. As further 
described in the examples, plasmids establishing 
chimpanzee adenovirus El-expressing cell lines and 
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recombinant El-deleted adenoviruses expressing a 
transgene are prepared. 

The viral sequences used in the vectors 
and cell lines described below may be generated by using 
5 the teachings and references contained herein, coupled 
with standard recombinant molecular cloning techniques 
known and practiced by those skilled in the art. 

jj. El-Expressing Complementation Cell Lines 
To generate recombinant chimpanzee 

10 adenoviruses (Ad) deleted in any of the genes described 
above, the function of the deleted gene region, if 
essential to the replication and infectivity of the 
virus, must be supplied to the recombinant virus by a 
helper virus or cell line, i.e., a complementation or 

15 packaging cell line. For example, to generate a 

replication-defective chimpanzee adenovirus vector, a 
cell line is needed which expresses the El gene products 
of the chimpanzee adenovirus. The protocol for the 
generation of the cell lines expressing the chimpanzee El 

20 gene products (Examples 3 and 4) is followed to generate 
a cell line which expresses any selected chimpanzee 
adenovirus gene. 

Conventional assays were not useful in 
identifying the chimpanzee adenovirus El-expressing cell 

25 line and a novel AAV augmentation assay was developed to 
identify the chimpanzee adenovirus El-expressing cell 
line. This assay is useful to identify El function in 
cell lines made by using the El genes of other 
uncharacterized adenoviruses, e.g., from other species. 

30 That assay is described in Example 4B below. 

According to this invention, the selected 
chimpanzee adenovirus gene, e.g., El, is under the 
transcriptional control of a promoter for expression in a 
selected parent cell line. Inducible or constitutive 

35 promoters may be employed for this purpose. Among 
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inducible promoters are included the sheep 
metallothionine promoter, inducible by zinc, or the mouse 
mammary tumor virus (MMTV) promoter, inducible by a 
glucocorticoid, particularly, dexamethasone. Other 
5 inducible promoters, such as those identified in 

international patent application W095/ 13392, published 
May 18, 1995, and incorporated by reference herein may 
also be used in the production of packaging cell lines 
according to this invention. Constitutive promoters in 
10 control of the expression of the chimpanzee adenovirus 

gene may be employed also. The promoter used to express 
El as exemplified below is the well-known constitutive 

murine PGK promoter. 

A parent cell is selected for the 

15 generation of a novel cell line expressing any desired CI 
or C68 gene. Without limitation, such a parent cell line 
may be HeLa [ATCC Accession No. CCL 2], A549 [ATCC 
Accession No. CCL 185], KB [CCL 17], Detroit [e.g., 
Detroit 510, CCL 72] and WI-38 [CCL 75] cells. These 

20 cell lines are all available from the American Type 

Culture collection, 12301 Parklawn Drive, Rockville, MD, 
USA. other suitable parent cell lines may be obtained 

from other sources. 

The present invention provides an 

25 exemplary cell line which contains and expresses the 

chimpanzee C68 or CI Ad El gene, as described in detail 
in Examples 3 and 4 below. Briefly described, the entire 
chimpanzee adenovirus El region was cloned and, by a 
series of plasmid manipulations, it was placed under the 

30 control of a murine PGK promoter in a desired shuttle 
vector. See Figs. 5A-56 and 6A-66. 

After the desired shuttle vector 
containing the adenoviral sequences (i.e., pGPGK-C68 El 
described in Example 3) was transfected into the selected 

35 parental cell line (e.g., HeLa), expression of the El 
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gene was detected. Conventional G418 selection as 
described in Example 4A was used to generate stable 
clones of these El-expressing cells. The resulting cell 
line is thus able to provide chimpanzee Ad £1 gene 
5 products to the replication-defective recombinant virus 
(see Example 5) to allow productive infection and 
recovery of the recombinant virus. 

The El-expressing cell lines are useful in 
the generation of recombinant chimpanzee adenovirus El 

10 deleted vectors. Cell lines constructed using 

essentially the same procedures that express one or more 
other chimpanzee adenoviral gene products are useful in 
the generation of recombinant chimpanzee adenovirus 
vectors deleted in the genes that encode those products. 

15 Further, cell lines which express other 

human Ad El gene products are also useful in generating 
the chimpanzee recombinant Ads of this invention. 

III. Recombinant Viral Particles as Vectors 
The compositions of this invention 

20 comprise desirable viral vectors, that deliver a 

functional, normal or therapeutic gene to cells. Such 
vectors comprise chimpanzee adenovirus DNA sequence and a 
selected heterologous gene operative ly linked to 
regulatory sequences which direct expression of the gene. 

25 The vector is capable of expressing the gene product in 
an infected mammalian cell. The vector is preferably 
functionally deleted in one or more viral genes. A 
minigene comprises the heterologous gene under the 
control of regulatory sequences. Optional helper viruses 

30 and/or packaging cell lines supply to the chimpanzee 
viral vectors any necessary products of deleted 
adenoviral genes. 

The term "functionally deleted" means that 
a sufficient amount of the gene region is removed or 

35 otherwise damaged, e.g., by mutation or modification, so 



wo 98/10087 



PCT/US97/15694 



19 

that the gene region is no longer capable of producing 
functional products of gene expression. If desired, the 
entire gene region may be removed. 

The viral sequences, helper viruses, if 
5 needed, and recombinant viral particles, and other vector 
components and sequences employed in the construction of 
the vectors described herein are obtained as described 
above. The DNA sequences of the two chimpanzee 
adenoviruses are employed to construct vectors and cell 

10 lines useful in the preparation of such vectors. 

Modifications of the nucleic acid 
sequences forming the vectors of this invention, 
including sequence deletions, insertions, and other 
mutations may be generated using standard molecular 

15 biological techniques and are within the scope of this 
invention. 

A. The '*Minigene" 

The methods employed for the 
selection of the transgene, the cloning and construction 
20 of the "minigene" and its insertion into the viral vector 
are within the skill in the art given the teachings 
provided herein. By "minigene" is meant the combination 
of a selected heterologous gene and the other regulatory 
elements necessary to transcribe the gene and express the 
gene product in a host cell. The gene is operatively 
linked to regulatory components in a manner which permits 
its transcription. Such components include conventional 
regulatory elements necessary to drive expression of the 
transgene in a cell transfected with the viral vector. 
Thus the minigene also contains a selected promoter which 
is linked to the transgene and located, with other 
regulatory elements, within the selected viral sequences 
of the recombinant vector. 

Selection of the promoter is a 
routine matter and is not a limitation of this invention. 
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Useful promoters may be constitutive promoters or 
regulated (inducible) promoters, which will enable 
control of the amount of the transgene to be expressed. 
For example, a desirable promoter is that of the 
5 cytomegalovirus immediate early promoter /enhancer [see, 
e.g., Boshart et al, Cell , 41:521-530 (1985)]. Another 
desirable promoter includes the Rous sarcoma virus LTR 
promoter/ enhancer. Still another promoter/ enhancer 
sequence is the chicken cytoplasmic B-actin promoter [T. 

10 A. Kost et al, Nucl. Acids Res, , 11(23) :8287 (1983)]. 

Other suitable or desirable promoters may be selected by 
one of skill in the art. 

The minigene may also desirably 
contain nucleic acid sequences heterologous to the viral 

15 vector sequences including sequences providing signals 

required for efficient polyadenylation of the transcript 
(poly-A or pA) and introns with functional splice donor 
and acceptor sites. A common poly-A sequence which is 
employed in the exemplary vectors of this invention is 

20 that derived from the papovavirus SV-40. The poly-A 

sequence generally is inserted in the minigene following 
the transgene sequences and before the viral vector 
sequences. A common intron sequence is also derived from 
SV-40, and is referred to as the SV-40 T intron sequence. 

25 A minigene of the present invention may also contain such 
an intron, desirably located between the promoter/ 
enhancer sequence and the transgene. Selection of these 
and other common vector elements are conventional [see, 
e.g., Sambrook et al, "Molecular Cloning. A Laboratory 

30 Manual.", 2d edit.. Cold Spring Harbor Laboratory, New 
York (1989) and references cited therein] and many such 
sequences are available from commercial and industrial 
sources as well as from Genbank. 

As above stated, the minigene is 

35 located in the site of any selected deletion in the viral 
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vector, such as the site of the El gene region deletion 
or E3 gene region deletion, among others which may be 
selected. 

B, Construction of The Viral Plasmid 

5 Vector 

The chimpanzee adenovirus vectors 
useful in this invention include recombinant, defective 
adenoviruses, that is, chimpanzee adenovirus sequences 
functionally deleted in the Ela or Elb genes, and 
10 optionally bearing other mutations, e.g., temperature- 
sensitive mutations or deletions in other genes. It is 
anticipated that these chimpanzee sequences are also 
useful in forming hybrid vectors from other adenovirus 
and/or adeno-associated virus sequences. Homologous 
15 adenovirus vectors prepared from human adenoviruses are 
described in the published literature [see, for example, 
Kozarsky I and II, cited above, and references cited 
therein, U. S. Patent No. 5,240,846]. 

In the construction of useful 
chimpanzee adenovirus vectors for delivery of a gene to 
the human (or other mammalian) cell, a range of 
adenovirus nucleic acid sequences can be employed in the 
vectors. A vector comprising minimal chimpanzee 
adenovirus sequences may be used in conjunction with a 
helper virus to produce an infectious recombinant virus 
particle. The helper virus provides essential gene 
products required for viral infectivity and propagation 
of the minimal chimpanzee adenoviral vector. When only 
one or more selected deletions of chimpanzee adenovirus 
genes are made in an otherwise functional viral vector, 
the deleted gene products can be supplied in the viral 
vector production process by propagating the virus in a 
selected packaging cell line that provides the deleted 
gene functions in trans. 
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1. Recombinant Minimal Adenovirus 

A minimal chimpanzee Ad virus is 
a viral particle containing only the adenovirus cis- 
elements necessary for replication and virion 
5 encapsidation, which cis-elements flank the heterologous 
gene. That is, the vector contains only the cis-acting 
5* and 3» inverted terminal repeat (ITR) sequences of the 
adenoviruses of this invention (which function as origins 
of replication) and the native 5' packaging/ enhancer 

10 domains (that contain sequences necessary for packaging 
linear Ad genomes and enhancer elements for the El 
promoter) . See, for example, the techniques described 
for preparation of a "minimal" human Ad vector in 
International Patent Application W096/13597, published 

15 May 9, 1996, and incorporated herein by reference. 

2. Other Defective Adenoviruses 
Recombinant, replication- 
deficient adenoviruses of this invention may also contain 
more than the minimal chimpanzee adenovirus sequences 

20 defined above. These other Ad vectors can be 

characterized by deletions of various portions of gene 
regions of the virus, and infectious virus particles 
formed by the optional use of helper viruses and/ or 
packaging cell lines, as described herein. 

25 As one example, suitable vectors 

may be formed by deleting all or a sufficient portion of 
the adenoviral immediate early gene Ela and delayed early 
gene Elb, so as to eliminate their normal biological 
functions. Replication-defective El-deleted viruses are 

30 capable of replicating and producing infectious virus 
when grown on a chimpanzee adenovirus-transf ormed, 
complementation cell line containing functional 
adenovirus Ela and Elb genes which provide the 
corresponding gene products in trans. Based on the 

35 homologies to known adenovirus sequences, it is 
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anticipated that, as is true for the human recombinant 
El-deleted adenoviruses of the art, the resulting 
recombinant chimpanzee adenovirus is capable of infecting 
many cell types and can express a transgene, but cannot 
5 replicate in most cells that do not carry the chimpanzee 
El region DNA unless the cell is infected at a very high 
multiplicity of infection. 

As another example, all or a 
portion of the adenovirus delayed early gene E3 may be 
10 eliminated from the chimpanzee adenovirus sequence which 
forms a part of the recombinant virus. The function of 
chimpanzee E3 is believed to be irrelevant to the 
function and production of the recombinant virus 
particle. 

15 Chimpanzee adenovirus vectors 

may also be constructed having a deletion of the E4 gene. 
Still another vector of this invention contains a 
deletion in the delayed early gene E2a. 

Deletions may also be made in 

20 any of the late genes LI through L5 of the chimpanzee 
adenovirus genome. Similarly, deletions in the 
intermediate genes IX and IVa2 may be useful for some 
purposes. Other deletions may be made in the other 
structural or non-structural adenovirus genes. 

25 The above discussed deletions 

may be used individually, i.e., an adenovirus sequence 
for use in the present invention may contain deletions of 
El only. Alternatively, deletions of entire genes or 
portions thereof effective to destroy their biological 

30 activity may be used in any combination. For example, in 
one exemplary vector, the adenovirus sequence may have 
deletions of the El genes and the E4 gene, or of the El, 
E2a and E3 genes, or of the El and E3 genes, or of El, 
E2a and E4 genes, with or without deletion of E3, and so 

35 on. As discussed above, such deletions may be used in 
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combination with other mutations, such as temperature- 
sensitive mutations, to achieve a desired result. 

The minigene containing the 
transgene may be inserted optionally into any deleted 
5 region of the chimpanzee Ad virus. Alternatively, the 
minigene may be inserted into an existing gene region to 
disrupt the function of that region, if desired. 

The construction of exemplary 
El-deleted chimpanzee Ad virus vectors is described in 
10 detail in Example 5 below. Desirably, such a vector 

contains chimpanzee adenovirus sequences Ad m.u. 0-1.3, 
followed by a minigene containing the transgene of 
interest (e.g., a therapeutic gene for the correction of 
a genetic defect in a patient or a marker gene to 
15 visualize infected cells) and the sequence Ad m.u. 9 to 
100 of CI or C68. These recombinant adenoviruses are 
functionally deleted of Ela and Elb. 

C. Production of the Recombinant Viral 

Particle 

20 1. Helper Viruses 

Depending upon the chimpanzee 
adenovirus gene content of the viral vectors employed to 
carry the minigene, a helper adenovirus or non- 
replicating virus fragment may be necessary to provide 

25 sufficient chimpanzee adenovirus gene sequences necessary 
to produce an infective recombinant viral particle 
containing the minigene. 

Useful helper viruses contain 
selected adenovirus gene sequences not present in the 
30 adenovirus vector construct and/ or not expressed by the 
packaging cell line in which the vector is transfected. 
A preferred helper virus is desirably replication- 
defective and contains a variety of adenovirus genes in 
addition to the sequences described above. The helper 
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expressing cell lines described herein. 

Most preferably for C68, the 
"helper" virus is a fragment formed by clipping the C 
5 terminal end of the C68 genome with Sspl, which removes 
about 1300 bp from the left end of the virus. This 
clipped virus is then co-transfected into the El- 
expressing cell line with the plasmid DNA, thereby 
forming the recombinant virus by homologous recombination 

10 with the C68 sequences in the plasmid. 

Because there is no similarly 
unique restriction site in the 5' end of CI, to create a 
recombinant virus, the Spel site at position 173 3 is 
replaced with a unique Not I site, generating the 

15 modified CI NotI genome of about 35,526 bp. See, e.g., 
Figs 12A-12F* 

Helper viruses may also be 
formed into poly-cation conjugates as described in Wu et 
al, J. Biol. Chem. . 264:16985-16987 (1989); K. J. Fisher 

20 and J. M. Wilson, Biochem, J. . 299:49 (April 1, 1994). 
Helper virus may optionally contain a second reporter 
minigene. A number of such reporter genes are known to 
the art. The presence of a reporter gene on the helper 
virus which is different from the transgene on the 

25 adenovirus vector allows both the Ad vector and the 

helper virus to be independently monitored. This second 
reporter is used to enable separation between the 
resulting recombinant virus and the helper virus upon 
purification. 

30 2. Assemblv of Viral Particle and 

Infection of a Cell Line 

Assembly of the selected DNA 
sequences of the adenovirus, and the transgene and other 
vector elements into various intermediate plasmids and 
35 shuttle vectors, and the use of the plasmids and vectors 
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to produce a recombinant viral particle are all achieved 
using conventional techniques. Such techniques include 
conventional cloning techniques of cDNA such as those 
described in texts [Sambrook et al, cited above], use of 
overlapping oligonucleotide sequences of the adenovirus 
genomes, polymerase chain reaction, and any suitable 
method which provides the desired nucleotide sequence. 
Standard transfection and co-transfection techniques are 
employed, e.g., CaPO^ precipitation techniques. Other 
conventional methods employed include homologous 
recombination of the viral genomes, plaguing of viruses 
in agar overlay, methods of measuring signal generation, 
and the like. 

For example, following the 
construction and assembly of the desired minigene- 
containing viral vector, the vector is transfected in 
vitro in the presence of a helper virus into the 
packaging cell line. Homologous recombination occurs 
between the helper and the vector sequences, which 
permits the adenovirus-transgene sequences in the vector 
to be replicated and packaged into virion capsids, 
resulting in the recombinant viral vector particles. The 
current method for producing such virus particles is 
transfection-based. However, the invention is not 
limited to such methods. 

The resulting recombinant 
chimpanzee adenoviruses are useful in transferring a 
selected transgene to a selected cell. In in vivo 
experiments with the recombinant virus grown in the 
packaging cell lines, the El -deleted recombinant 
chimpanzee adenovirus demonstrates utility in 
transferring a transgene to a non-chimpanzee, preferably 
a human, cell. 
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IV. Use of the Recombinant Viirus Vectors 
The resulting recombinant chimpanzee 
adenovirus containing the minigene (produced by 
cooperation of the adenovirus vector and helper virus or 
5 adenoviral vector and packaging cell line^ as described 
above) thus provides an efficient gene transfer vehicle 
which can deliver the transgene to a human patient in 

vivo or ex vivo. 

The above-described recombinant vectors 

10 are administered to humans according to published methods 
for gene therapy. A chimpanzee viral vector bearing the 
selected transgene may be administered to a patient, 
preferably suspended in a biologically compatible 
solution or pharmaceutically acceptable delivery vehicle. 

15 A suitable vehicle includes sterile saline. Other 
aqueous and non-aqueous isotonic sterile injection 
solutions and aqueous and non-aqueous sterile suspensions 
known to be phannaceutically acceptable carriers and well 
known to those of skill in the art may be employed for 

20 this purpose. 

The chimpanzee adenoviral vectors are 
administered in sufficient amounts to transduce the human 
cells and to provide sufficient levels of gene transfer 
and expression to provide a therapeutic benefit without 

25 undue adverse or with medically acceptable physiological 
effects, which can be determined by those skilled in the 
medical arts. Conventional and pharmaceutically 
acceptable routes of administration include, but are not 
limited to, direct delivery to the liver, intranasal, 

30 intravenous, intramuscular, subcutaneous, intradermal, 

oral and other parental routes of administration. Routes 
of administration may be combined, if desired. 

Dosages of the viral vector will depend 
primarily on factors such as the condition being treated, 

35 the age, weight and health of the patient, and may thus 
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vary among patients. For example, a therapeutically 
effective human dosage of the viral vector is generally 
in the range of from about 20 to about 100 ml of saline 
solution containing concentrations of from about 1 x 10^ 
to 1 X 10^^ pfu/ml virus vector* A preferred human 
dosage is estimated to be about 50 ml saline solution at 
2 X 10^^ pfu/ml. The dosage will be adjusted to balance 
the therapeutic benefit against any side effects and such 
dosages may vary depending upon the therapeutic 
application for which the recombinant vector is employed* 
The levels of expression of the transgene can be 
monitored to determine the frequency of dosage 
administration • 

An optional method step involves the co- 
administration to the patient, either concurrently with, 
or before or after administration of the viral vector, of 
a suitable amount of a short acting immune modulator* 
The selected immune modulator is defined herein as an 
agent capable of inhibiting the formation of neutralizing 
antibodies directed against the recombinant vector of 
this invention or capable of inhibiting cytolytic T 
lymphocyte (CTL) elimination of the vector. The immune 
modulator may interfere with the interactions between the 
T helper subsets (T^ or Th2) and B cells to inhibit 
neutralizing antibody formation. Alternatively, the 
immune modulator may inhibit the interaction between T^^ 
cells and CTLs to reduce the occurrence of CTL 
elimination of the vector. 

A variety of useful immune modulators and 
dosages for use of same are disclosed, for example, in 
Yang et al . , J. Virol. . 70(9) (Sept., 1996); 
International Patent Application No. WO96/12406, 
published May 2, 1996; and International Patent 
Application NO.PCT/US96/03035, all incorporated herein by 
reference. 
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The recombinant chimpanzee adenoviruses 
may also be employed as vaccines or immune response- 
inducing compositions. The present invention provides a 
recombinant replication-defective chimpanzee Ad which can 
5 contain in any of its adenovirus sequence deletions a 
gene encoding a desired antigen. The chimpanzee 
adenovirus is likely to be better suited for use as a 
live recombinant virus vaccine in different animal 
species compared to an adenovirus of human origin. The 
recombinant adenoviruses can be used as prophylactic or 
therapeutic vaccines against any pathogen for which the 
antigen (s) crucial for induction of an immune response 
and able to limit the spread of the pathogen has been 
identified and for which the cDNA is available. 

Because the recombinant chimpanzee 
adenoviruses described above are deleted in the El 
sequences, the adenoviruses are replication defective and 
thus highly unlikely to spread within a host or among 
individuals. The recombinant virus lacks oncogenic 
potential because the El gene, that can function as an 
oncogene in some adenovirus strains, has been deleted. 

With respect to efficacy, the recombinant, 
replication-defective adenoviruses of this invention are 
expected to be highly efficacious at inducing cytolytic T 
cells and antibodies to the inserted heterologous 
antigenic protein expressed by the virus. This has been 
demonstrated with a recombinant, replication-defective 
human Ad containing a sequence encoding the rabies virus 
glycoprotein as the heterologous gene. See, e.g., Z. Q. 
Xiang at al., Virol. . 219:220-227 (1996). 

As described above and in the examples 
below, in the site of the El deletion of either of the 
two chimpanzee adenoviruses of this invention, and under 
control of a promoter heterologous to adenovirus, a 
sequence encoding a protein heterologous to the 
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adenovirus is inserted using techniques known to those of 
skill in the art. The heterologous nucleic acid encodes 
a protein which is desirably capable of inducing an 
isanune response to a pathogen when administered to an 
5 inaaunocompetent host. Such a protein nay be a protein 
from, among others, rabies virus, human papilloma virus, 
human immunodeficiency virus (HIV) , and respiratory 
syncytial virus (RSV) , as well as antigens associated 
with diseases of other mammals. 

10 It is also anticipated that the vaccine 

method of the present invention may be employed with a 
tumor-associated protein specific for a selected 
malignancy. These tumor antigens include viral 
oncogenes, such as E6 and E7 of hiiman papilloma virus, or 

15 cellular oncogenes such as mutated ras or p53. 
Particularly, where the condition is human 
immunodeficiency virus (HIV) infection, the protein is 
preferably HIV glycoprotein 120 for which sequences are 
available from GenBank. Where the condition is human 

20 papilloma virus infection, the protein is selected from 
the group consisting of E6, E7 and/or LI [Seedorf, K. et 
al, Virol. . 145:181-185 (1985)]. Where the condition is 
respiratory syncytial virus infection, the protein is 
selected from the group consisting of the glyco- (G) 

25 protein and the fusion (F) protein, for which sequences 
are available from GenBank. In addition to these 
proteins, other virus-associated proteins, including 
proteins which are antigens for disease-causing agents of 
other mammals, e.g., domestic animals, horses, farm 

30 animals, etc, are readily available to those of skill in 
the art. Selection of the heterologous proteins is not a 
limiting factor in the design of vaccine compositions of 
this invention. 
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A recombinant replication-defective 
chimpanzee adenoviral vector bearing a gene encoding an 
immunogenic protein may be administered to a human or 
other mammalian patient, preferably suspended in a 
5 biologically compatible solution or pharmaceutical ly 
acceptable delivery vehicle. A suitable vehicle is 
sterile saline. Other aqueous and non-aqueous isotonic 
sterile injection solutions and aqueous and non-aqueous 
sterile suspensions known to be pharmaceutically 

10 acceptable carriers and well known to those of skill in 
the art may be employed for this purpose. 

Optionally, a vaccinal composition of the 
invention may be formulated to contain other components, 
including, e.g. adjuvants, stabilizers, pH adjusters, 

15 preservatives and the like. Such components are well 
known to those of skill in the vaccine art. 

The recombinant, replication defective 
adenoviruses are administered in a "pharmaceutically 
effective amount", that is, an amount of recombinant 

20 adenovirus that is effective in a route of administration 
to transfect the desired cells and provide sufficient 
levels of expression of the selected gene to provide a 
vaccinal benefit, i.e., some measurable level of 
protective immunity. 

25 Conventional and pharmaceutically 

acceptable routes of administration include, but are not 
limited to, intranasal, intramuscular, intratracheal, 
subcutaneous, intradermal, rectal, oral and other 
parental routes of administration. Routes of 

30 administration may be combined, if desired, or adjusted 
depending upon the immunogen or the disease. For 
example, in prophylaxis of rabies, the subcutaneous, 
intratracheal and intranasal routes are preferred. The 
route of administration primarily will depend on the 

35 nature of the disease being treated. 
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Doses or effective amounts of the 
recombinant replication-defective Ad virus will depend 
primarily on factors such as the condition, the selected 
gene, the age, weight and health of the animal, and may 
5 thus vary among animals. For example, a prophylactically 
effective amount or dose of the Ad vaccine is generally 
in the range of from about 100 (xl to about 10 ml of 
saline solution containing concentrations of from about 1 
X 10^ to 1 X 10*^ plague forming units (pfu) virus/ml. A 

10 preferred dose is from about 1 to about 10 ml saline 
solution at the above concentrations. The levels of 
immunity of the selected gene can be monitored to 
determine the need, if any, for boosters. Following an 
assessment of antibody titers in the serum, optional 

15 booster immunizations may be desired. 

An additional use of the recombinant 
adenovirus vectors described herein resides in their use 
as expression vectors for the production of the products 
encoded by the heterologous genes. For example, the 

20 recombinant adenoviruses containing a gene inserted into 
the location of an El deletion may be transfected into an 
El-expressing cell line as described above. The 
transfected cells are then cultured in the conventional 
manner, allowing the recombinant adenovirus to express 

25 the gene product from the promoter. The gene product may 
then be recovered from the culture medium by known 
conventional methods of protein isolation and recovery 
from culture. 

The following examples illustrate the 
30 cloning of the chimpanzee adenoviruses and the 

construction and testing of the chimpanzee Ad El 
expressing cell line and the construction of exemplary 
recombinant adenovirus vectors of the present invention. 
These examples are illustrative only, and do not limit 
35 the scope of the present invention. 
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Examnle 1 - Virus Stoc ks and Propagation 

The CI [ATCC Accession No. VR-20] and C68 [ATCC 
Accession No. 594] virus stocks were obtained and 
propagated in 293 cells [ATCC CRL1573] cultured in 
5 Dulbecco's Modified Eagles Medium (DMEM; Sigma, St. 

Louis, MO.) supplemented with 10% fetal calf serum (FCS) 
[Sigma or Hyclone, Logan, UT] and 1 % Penicillin- 
Streptomycin (Sigma) . Infection of 293 cells was carried 
out in DMEM supplemented with 2% FCS for the first 24 
hours, after which FCS was added to bring the final 
concentration to 10%. Infected cells were harvested when 
100% of the cells exhibited virus-induced cytopathic 
effect (CPE) , collected, and concentrated by 
centrifugation. Cell pellets were resuspended in 10 mM 
Tris (pH 8.0), and lysed by 3 cycles of freezing and 
thawing. 

Virus preparations were obtained following two 
ultra centrifugation steps on cesium chloride density 
gradients and stocks of virus were diluted to 1 x 10^^ 
particles/ml in 10 mM Tris/ 100 mM NaCl/50% glycerol and 
stored at -70 *C. 

Example 2 - Cloning and Seouen eina of Viral Genomic DNA 

Genomic DNA was isolated from the purified 
virus preparations of Example 1, following standard 
methods [see, e.g., M. S. Horwitz et al, "Adenoviridae 
and Their Replication", Virology, second edition, pp. 
1712, ed. B. N. Fields et al, Raven Press Ltd., New York 
(1990); B, J. Carter, in "Handbook of Parvoviruses", ed. 
P. Tijsser, ORG Press, pp. 155-168 (1990)] and digested 
with a panel of 16 restriction enzymes following the 
manufacturers* recommendations. Enzymes that cut the DNA 
10-15 times were utilized for cloning of the viral DNA 
into pBluescript SK+. Except as noted, all restriction 
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and modifying enzymes used in this and the following 
examples were obtained from Boehringer Mannheim, 
Indianapolis, IN. 

Manipulation of the genomic DNA to remove the 
5 covalently attached terminal protein was performed 

[Berkner and Sharp, Nucleic A cids Res, , 11: 6003 (1983)]. 
Taking advantage of the absence of Pac-I restriction 
sites, synthetic Pad linkers (New England Biolabs, 
Beverly, MA) were ligated onto the ends of the genomic 

10 DNA. Genomic DNA was digested with BamHI, PstI, Sail or 
Xbal and the restriction fragments (all but the genomic 
terminal fragments) were cloned into pBluescript SK+ 
(Stratagene, La Jolla, CA) . Fragments containing the 
left and right genomic termini were cloned into pNEB-193 

15 (New England Biolabs, Beverly, MA) as Pac-I/BamHI or Pac- 
I/Pst-I fragments. 

The clones generated for CI and C68 are 
illustrated in Figs. IC and 3C, respectively. The cloned 
fragments are described in Table III (CI) [nucleotide 

20 sequence numbers correspond with SEQ ID NO: 1] and Table 
IVA-IVB (CSS) (nucleotide sequence numbers correspond 
with SEQ ID NO: 2]. 
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Table III 






Construct Name 


Insert 
Pise 


Clone # 


§eqMen<?^ 






8477 


OCA O CO 

281 


6135-14611 




pBS:Cl-Bam-B 


8253 


one 

285 


24678-32930 






3990 


c o 
252 


17259-21248 


10 


dBS : CI— Bam— D 


3429 


2o3 , 2o9 
275 


21250-24677 




pBSrCl-Bam-E 


2537 


251 


3598- 6134 




nRS • PI — Ram— P 


2203 


267,270, 


14612-16814 




dMKR* pi — Raift- G 


1927 


516 


1-1927 left 


15 


pBS : Cl-Bam-H 


1632 


486,487 


32931-34562 




pBS:Cl-Bain-I 


1538 


288-293 
483,485 


2060- 3597 




pNEB:Cl-Baxn-J 


962 


519 


34563-35524 
iricrliii end 






288 


256, 295 
296, ,298 


16971-17258 




pBS:Cl-Bam-L 


156 


260 


16815-16970 




pBS:Cl-Bam-M 


132 


259,261 
262 


1928- 2059 


25 


pBS : Cl-Bam-A/Pst 




423-428 


subclone ol 




pBS : Cl-Bam-B/Hindlll 


429-434 


subclone oi 




pNEB:C-lAscB 


7937 


955 


1-7937 left 
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Table IVA 



Construct Name Size 

pBS : C68-Pst-A 6768 

5 pBS:C68-Pst-B 6713 

pBS:C68-Pst-C 5228 

pBS:C68-Pst-D 2739 

10 pBS:C68-Pst-E 2647 

pBS:C68-Pst-F 1951 

pNEB:C68-P8t-6 1874 

15 pBS:C68-Pst-H 1690 

pBS:C68-Pst-I 1343 

pNEB;C68-Pst-J 1180 

20 pBS:C68-Pst~K 1111 

pBS:C68-Pst-L 964 

25 pBS:C68-Pst-H 888 

pBS:C68-Pst-N 408 

pBS:C68-Pst-0 380 

pBS:C68-Pst-P 285 

30 pBS:C68-Pst-Q 236 

pBS:C68<-Pst-R 114 



Clone # 



133,141 

213-217, 

303-305 

219-221 

78,140 

127,129 
146,151 

138,149 

502,505 
506 

128,135 
145,152 

222-224 

508 



87,131 

132,136 
225-230 

320,321, 
323,324 

319,322 

84,125 
130 



79,126 



82 



Sequence 

24790-31554 

4838-11550 

14811-20038 
12072-14810 
20039-22685 

32046-33996 
1-1874 left end 

23094-24783 

33997-35339 

35340-36519 
right end 

2763-3873 

3874-4837 

1875-2762 
22686-23093 

31666-32045 
11551-11835 
11836-12071 
31552-31665 
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BamHT Pracnnents 
pBS ; C68-Bam-A 

pBStCeS-Bam-B 



10 pBS:C69-Bain-C 

pBS:C68~Bain-D 
pNEB : C68-Bain-E 
15 pBS:C68-Baai-F 



37 

Table IVB 
Size Clone # 

16684 

8858 95,99 

101-103 
119-121, 
165, 166, 
169,171 

104,106 
167,179 
171 

195-197 
537,545 
198-200 



4410 

2986 
2041 
1540 



19836-36519 
right end 

3582-12439 



12440-16849 

16850-19835 
1- 2041 left end 
2042- 3581 



Hindlll Fragments 
pBR:C-68-Hind-B 



9150 



489,419, 23471-32620 
492 



20 



25 



30 



Cloned restriction fragments were ordered in 
the genome by comparison to known adenoviral sequences. 
The nucleotide sequence of both viruses was determined 
[Commonwealth Biotechnologies Incorporated, Richmond, 
VA] . The nucleotide sequence of the top strand of 01 DNA 
is reported in SEQ ID NO: 1. The nucleotide sequence of 
the top strand of C68 DNA is reported in SEQ ID NO: 2. 
Restriction maps were generated using a number of enzymes 
and compared to data obtained from restricted genomic DNA 
following electrophoreses on agarose gels. 

Regulatory and coding regions in the viral DNA 
sequences were identified by homology to known adenoviral 
sequences using the Mac Vector program (Oxford Molecular 
Group) and a Macintosh Quadra 610 computer (Apple 
Computer, Cupertino, CA) . See Tables I and II. Open 
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reading frames were translated and the predicted amino 
acid sequences examined for homology to previously 
described adenoviral protein sequences, Ad4, Ad5, Ad7, 
Adl2, and Ad40. See Fig. 2 below • 
5 The CI El coding region is defined as the 

sequences between the ElA translation initiation site at 
nucleotide 576 of SEQ ID NO: 1 and the ElB translation 
termination signal at nucleotide 3507 of SEQ ID NO: 1. 
The corresponding sequences in the C68 genome are located 
10 at nucleotides 577 and 3510 of SEQ ID NO: 2. Other open 
reading frames and regulatory elements of the viruses are 
being examined for homology with other adenoviral 
sequences . 

Our preliminary experiments have demonstrated 
15 that human antisera do not neutralize the chimpanzee 
adenoviruses in neutralizing antibody assays. 

Example 3 - Generation of Plasmid Vectors Expressing the 
Cl and C68 El Genes 

Plasmid vectors were constructed which encode 
20 the Cl and C68 El region genes, and these plasmids were 
used to generate stable cell lines expressing viral El 
proteins. 

A. PGPGK-C68 El 

pGPGK (gift of Gaung Ping Gao, University 

25 of Pennsylvania, Philadelphia, PA) is illustrated in Fig* 
5A. pGPGK is a 5,5 kb plasmid containing the known murine 
PGK promoter (indicated by the arrow on Fig- 5A) , 
followed by a multiple cloning site, a growth hormone 
polyA sequence, an SV40 ori, a neomycin resistance gene, 

30 an SV40 polyA sequence and an ampicillin resistance gene. 
The remainder of the plasmid is additional plasmid 
sequence. 
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As shown in Fig. 5B, the 5* end of the 
C-68 El region was derived from clone 245 which contains 
a defective version of the C-68 BamHI-E fragment (2042 
base pairs) in pNEB-193^ i.e., clone 245 was shown to 
5 lack approximately the first 30 base pairs of the C-68 
genomic sequence, a region not included in the final 
product of this construction scheme, pGPGK-C68 El. This 
plasmid pNEB-C68BamE was digested with BamHI and Hindlll 
and the 2.1kb fragment was ligated with similarly 

10 digested pGPGK DNA. The resulting plasmid is designated 
PGPGK-C68 BamE, illustrated in Fig. 5C. 

PGR primers SF-34 
(GCAGGTACCGCGAGTCAGATCTACAC) [SEQ ID NO: 4] and SF-35 
(CTGTCTGAGCTAGAGCTC) [SEQ ID NO: 5] were designed to 

15 introduce a Kpnl restriction site 31 base pairs upstream 
of the ElA translation initiation site (nucleotide 577 of 
SEQ ID NO: 2). Using clone 245 as template, a 293bp PCR 
product was obtained using reagents from Perkin Elmer 
(Foster City, CA) under the following conditions: 94 = 

20 BOC X 5 minutes; 25 cycles of 94 = BOC x 1 minute; 54 = 

BOC X 1 minute; 72=B0C x 2 minutes; and a final extension 
cycle of 72= BOC x 7 minutes. The PCR product was 
purified and is indicated by the hatched bar in Fig. 50. 

The PCR product was digested with Kpnl and 

25 Nhel, yielding a 253bp fragment, which was purified and 
ligated with similarly digested pGPGK-C68 BamE (Fig. 5C) 
DNA to yield pGPGK-C68 El-ATG (Fig. 5E) . 

The region derived from the PCR step was 
sequenced for several isolates and the adenovirus insert 

30 in pGP6K-C68El-AT6 was shown to match the expected 

sequence derived from C-68 genomic DNA. pGPGK-C68 E1-AT6 
(Fig. 5E) was digested with BamHI and the linearized 
plasmid treated with calf intestinal phosphatase. The 
purif ied/phosphatased backbone was ligated with the 
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1544bp C-68 BamF fragment isolated from pBS-C68 BamF 
(Fig. 5F) to yield the final plasmid, designated 
PGPGK-C68 El (Fig, 5G) . 

The C-*68 derived sequence in plasmid 
5 PGPGK-C68 El ends at the BamHI site corresponding to 
nucleotide 3581 of SEQ ID NO: 2 in the C-68 genomic 
sequence, which is 80bp downstream of the end of the ElB 
coding region. This expression plasmid contains from 
about nucleotide 546 to nucleotide 3581 of SEQ ID NO: 2 
10 which encodes Ela and Elb of chimpanzee Ad C68 under the 
control of the PGK promoter, 
B. pGPGK-Cl El 

The CI Ad El expression plasmid was 
constructed in a manner similar to that described above 
15 for the C68 El expression plasmid. Refer to Figs. 6A 
through 6G« 

The 5» end of the C-1 El region is 
isolated as a l.9kb SnaBI - Xbal fragment (Fig. 6B) and 
is cloned into pGPGK (Fig. 6A) digested with Xbal and 

20 EcoRV. The resulting pGPGK^Cl (map units 1.3-6.6) (Fig. 
6D) is used as the template for PCR. Primers are 
designed to introduce a Kpnl site just upstream of the CI 
El region translation initiation codon (El-ATG) at 
nucleotide 578 of the CI genomic DNA. (See Fig. 6C) . 

25 The PCR product is double digested with 

Kpnl and Kspl and ligated with similarly digested 
pGPGK-Cl (m.u. 1.3-6.6) to yield pGPGK-Cl El-ATG. 
Partial digestion of pGPGK-Cl El-ATG (Fig. 6E) with BamHI 
and isolation of the full length linear DNA, followed by 

30 Xbal digestion and isolation of the full length band, 

followed by ligation with similarly digested pBS-Cl Bam-I 
(Fig. 6F) yields the final product, pPGPK-Cl El (Fig. 
66) . The C-1 derived sequence in plasmid pGPGK-Cl El 
ends at the BamHI site corresponding to nucleotide 3599 

35 in the C-1 genomic sequence, which is 90bp downstream of 
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the end of the ElB coding region. This expression 
plasmid contains from about nucleotide 548 to about 
nucleotide 3581 of SEQ ID NO: 1 which encodes Ela and Elb 
of Ad CI under the control of the PGK promoter. 

5 Example 4 - Generation of Cell Lines Expressing 
Chimpanzee Adenovirus El Proteins 

Cell lines expressing viral El proteins were 
generated by transfecting HeLa (ATCC Acc. No. CCL2) and 
A549 (ATCC Acc. No. CCL185) cell lines with either 

10 pGPGK-Cl El or pGPGK-68 El of Example 3. These cell 
lines are necessary for the production of El deleted 
recombinant chimpanzee adenoviruses by co-transf ection of 
genomic viral DNA and the expression plasmids described 
above. Transfection of these cell lines, as well as 

15 isolation and purification of recombinant chimpanzee 
adenoviruses therefrom were performed by methods 
conventional for other adenoviruses, i.e., human 
adenoviruses [see, e.g., Horwitz, cited above and other 
standard texts]. 

20 A. Cell lines expressing CI and C68 El 

proteins 

HeLa and A549 cells in 10cm dishes were 
transfected with 10 ixq of pGPGK-Cl-El DNA or pGPGK-C68-El 
DNA using a Cellphect™ kit (Pharmacia, Uppsala, Sweden) 

25 and following the manufacturer's protocol. 22 hours 

post-transfection, the cells were subjected to a three 
minute glycerol shock (15% glycerol in Hepes Buffered 
Saline, pH 7.5) washed once in DMEM (HeLa) or F12K (A549; 
Life Technologies, Inc., Grand Island, NY) media 

30 supplemented with 10% FCS, 1% Pen-Strep, then incubated 
for six hours at 37<>C in the above described media. The 
transfected cells were then split into duplicate 15cm 
plates at ratios of 1:20, 1:40, 1:80, 1:160, and 1:320. 
Following incubation at 37 ^C overnight, the media was 
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supplemented with G418 (Life Technologies, Inc.) at a 
concentration of 1/ig/inl- The media was replaced every 5 
days and clones were isolated 20 days post-transf ection* 

Thirty-two A549 and 16 HeLa CI El cell 
clones and 40 A549 and 37 HeLa C68 El cell clones were 
isolated and assayed for their ability to augment adeno- 
associated virus (AAV) infection and expression of 
recombinant LacZ protein as described below. 

B. AAV Augmentation Assay for Screening El 
Expressing Cell Lines 

AAV requires adenovirus-encoded proteins 
in order to complete its life cycle. The adenoviral El 
proteins as well as the E4 region encoded ORF-6 protein 
are necessary for the augmentation of AAV infection. 
A novel assay for El expression based on AAV augmentation 
is disclosed herein. Briefly, the method for identifying 
adenoviral El- expressing cells comprises the steps of 
infecting in separate cultures a putative adenovirus El- 
expressing cell and a cell containing no adenovirus 
sequence, with both an adeno-associated virus (AAV) 
expressing a marker gene and an AAV expressing the 0RF6 
of the E4 gene of human adenovirus, for a suitable time. 
The marker gene activity in the resulting cells is 
measured and those cells with significantly greater 
measurable marker activity than the control cells are 
selected as confirmed El-expressing cells. In the 
following experiment, the marker gene is a lacZ gene and 
the marker activity is the appearance of blue stain. 

For example, the cell lines described 
above, as well as untransf ected control cells (A549 and 
HeLa) are infected with 100 genomes per cell of an AAV 
vector bearing a marker gene, e.g., AV.LacZ [K. Fisher et 
al., J, Virol. . 7Q:520 (1996)] and an AAV vector 
expressing the 0RF6 region of human Ad5 (AV.orfS) (see 
SEQ ID NO: 3). The DNA sequence [SEQ ID NO: 3] of the 
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plasmid pAV*CMVALP.GRE-ORF6, also called AV.orfS, 
generates a novel recombinant adeno-associated virus 
(rAAV) containing the LacZ transgene and the Ad E4 ORF 6, 
which is an open reading frame whose expression product 
5 facilitates single-stranded (ss) to double-stranded (ds) 
conversion of rAAV genomic DNA. In SEQ ID NO: 3, the AAV 
5« inverted terminal repeat (ITR) is at nucleotides 53- 
219; the cytomegalovirus (CMV) enhancer /promoter is at 
nucleotides 255-848; the human placenta alkaline 

10 phosphatase cDNA (ALP) is at nucleotides 914-2892; the 
SV40 polyadenylation (polyA) signal is at nucleotides 
2893-3090; the glucocorticoid dependent (GRE) promoter is 
at nucleotides 3114-3393; the Ad5 E4-ORF6 cDNA is at 
nucleotides 3402-4286; the SV40 polyA signal is at 

15 nucleotides 4315-4512; and the 3' AAV ITR is at 

nucleotides 4547 - 4713. All other nucleotides are 
plasmid-derived. These vectors are incubated in medium 
containing 2% FCS and 1% Pen-Strep at 37 «C for 4 hours, 
at which point an equal volume of medium containing 10% 

20 FCS is added. It should be understood by one of skill in 
the art that any marker gene (or reporter gene) may be 
employed in the first AAV vector of this assay, e.g., 
alkaline phosphatase, lucif erase, and others. An 
antibody-enzyme assay can also be used to quantitate 

25 levels of antigen, where the marker expresses an antigen. 
The assay is not limited by the identity of the marker 
gene. Twenty to twenty-four hours post-infection, the 
cells are stained for LacZ activity using standard 
methods. After 4 hours the cells are observed 

30 microscopically and cell lines with significantly more 

blue cells than the A549 or HeLa cell controls are scored 
as positive. 

Eight A549 (A-2,3, 8, 13, 15, 18,23,38) and 
five HeLa (H-3,4,15,16,20) cell clones are significantly 
35 positive in the AAV augmentation assay and the three best 
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of each cell type (A-18, A-13 and H-16, H-4, H-20) , 

when tested, support the growth of El deleted recombinant 
C68 viruses. 

Four A549 ( A-3, 6, 19, 22) and nine HeLa 
5 (H-2,5-7, 11-16) cell clones are significantly positive 
in the AAV augmentation assay and the three best of each 
cell type (A-3, A-19, A-22 and H-5, H-12, H-14) , when 
tested, support the growth of El deleted recombinant CI 
viruses . 

10 Example 5 - Generation of Recombinant Chim panzee 
Adenoviruses 

Recombinant chimpanzee adenovirus vectors are 
prepared using the CI and C68 sequences described herein 
and HEK293 cells. The cell lines described in Example 4 

15 may also be used similarly. Plasmids used to construct 

C68 and CI recombinant adenovirus vectors are illustrated 
in Figs. 7A through 7K, and 8A through 8K, respectively. 
See also Figs. IIA-IIK. 

A. pCl-CMV-LacZ 

20 pSP72 (Promega, Madison, WI) is modified 

by digestion with Bglll, followed by filling-in of the 
ends with Klenow and ligation with a synthetic 12bp Pad 
linker (New England Biolabs, Beverly, MA) to yield 
pSP72-Pac (Fig. 7A) , which contains a large multiple 

25 cloning site with conventional restriction enzyme 
cleavage sites. 

pSP72-Pac is digested with Pad and EcoRV 
and ligated with the 465bp PacI-SnaBI fragment isolated 
from pBSCl-BamG (Fig. 7B) to yield pSP-Cl-MU 0-1.3 (Fig. 
30 7C) . The CMV promoter-driven LacZ gene is isolated from 
pCMV-B (Clontech, Palo Alto, CA; Fig. 7D) as a 4 . 5kb 
EcoRI/Sall fragment and ligated with similarly digested 
pSP-Cl-MU 0-1.3 DNA to yield pSP-Cl-MU 0-1.3-CMV-B. 
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For the Initial step in the isolation of 
the CI Ad map units 9-16 region, pGEM-3Z (Promega, 
Madison, WI; Fig. 7F) and pBS-Cl-BamI (Fig. 7G) are 
digested with BamHI and SphI and the 310bp fragment from 
5 pBS-Cl-BamI is ligated with the pGEM-3Z backbone to form 
PGEM-C1-MU9-10 (Fig. 7H) . CI map units 10-17 are 
isolated from pBS-Cl BamE (Pig. 71) by digestion with 
BamHI. The 2.5 kb fragment is ligated with BamHI- 
digested pGEM-Cl-MU9-10 to form pGEM-Cl-MU9-17 (Fig. 7J) . 
10 The 2.9 kb fragment containing CI map unit 9-17 region is 
isolated from pGEM-Cl-MU9-17 by digestion with Hindlll 
and ligated with pSP-Cl-MU 0-1. 3-B (Fig. 7E) digested 
with Hindlll to form the final plasmid, pCl-CMV-LacZ 
(Fig. 7K) . 

15 pCl-CMV-LacZ (Fig. 7K) thus contains CI Ad 

mu 0 to 1.3, followed by the CMV promoter, an SD/SA, the 
LacZ gene, a SV40 poly A sequence and CI Ad mu. 9-17, as 
well as additional plasmid sequence. This plasmid is co- 
transfected into the El-expressing cell line with a left 

20 terminal clipped CI Ad fragment (or a replication- 
defective CI Ad helper virus) to produce by homologous 
recombination a recombinant chimpanzee adenovirus 
carrying the LacZ gene. 

C. pC68-CMV-LacZ 

25 pSP72-Pac (Fig. 8A; also Fig. 7A) is 

digested with Pad and EcoRV and ligated with the 465bp 
PacI-SnaBI fragment isolated from pBS-C68-BamE (Fig. 8B) 
to yield pSP-C68-MU 0-1.3 (Fig. 8C) . As above, the CMV 
promoter-driven LacZ gene is isolated from pCMVB 

30 (Clontech; Fig. 8D; also Fig. 7D) as a 4.5kb EcoRI-Sall 
fragment and ligated with similarly digested pSP-C68-MU 
0-1.3 DNA to yield pSP-C68-MU 0-1.3-CMV6 (Fig. 8E) . 

For the initial step in the isolation of 
the map unit 9-16 region of C68, pGEM-3Z (Fig. 8F; also 

35 Fig. 7F) and pBS-C68-BamF (Fig. 8G) are double digested 
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with BamHI and SphI and the 293bp fragment from 
pBS-C68-BamF is ligated with the pGEM-3Z backbone to form 
PGEM-C68-MU9-10 (Fig. 8H) . C68 map units 10-16.7 are 
isolated from pBS-C68 BamB (Fig. 81) by digestion with 
5 Xbal, followed by filling in of the ends and digestion 
with BamHI. The 2.4 kb fragment is ligated with 
BamHI/ EcoRV-digested pGEM-C68-MU9-10 to form 
PGEM-C68-MU9-16.7 (Fig. 8J) . The C68 map unit 9-16.7 
region is isolated from pGEM-C68-MU9-16 by digestion with 

10 EcoRI, filling in of the ends with Klenow and then 

digestion with Hindlll. The 2.7 kb fragment is ligated 
with pSP-C68-MU 0-1.3-CMVfl (Fig. 8E) , digested with 
Hindlll and PvuII to form the final plasmid, 
pC68-CMV-LacZ (Fig. 8K) . 

15 pC68-CMV-LacZ (Fig. 8K) thus contains C68 

Ad mu 0 to 1.3, followed by the CMV promoter, an SD/SA, 
the LacZ gene, a SV40 poly A sequence and C6B Ad mu 9- 
16.7, as well as additional plasmid sequence. This 
plasmid is co-transfected into the El-expressing cell 

20 line with another C68 Ad to produce by homologous 
recombination a recombinant chimpanzee adenovirus 
carrying the LacZ gene. 

D, pBS'Notx2 

The LacZ gene is removed from either 

25 pCl-CMV-LacZ (Fig. 7K) or pC68-CMV-LacZ (Fig. 8K) by 

digestion with NotI, and replaced by the coding sequence 
of any desired gene. This cloning step is facilitated by 
having the gene of interest flanked by NotI restriction 
sites, preferably with the upstream site in the 5' 

30 untranslated region of the gene. 

Such a cloning vector is derived from 
pBluescript SK+ (Stratagene, La Jolla, CA) by digestion 
of SK+ with Sail, followed by filling in of the ends and 
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ligation with a synthetic Bbp NotI linker (New England 

Biolabs, Beverly, MA): GCGGCCGC. 

CGCC6GC6 

The resulting pBS-Notx2 shuttle vector 
5 (Pig* 4B) is thus designed to facilitate cloning of cDNAs 
into pCl-CMV-LacZ (Fig. 7K) and pC68-CMV-LacZ (Fig. 8K; 
see also Fig. 4 A) as a NotI fragment. pBS-Notx2 has two 
NotI sites flanking a number of restriction sites 
suitable for cloning the cDNA to be expressed in the 

10 recombinant adenoviruses and the LacZ ORF from 

pBluescript is maintained, allowing blue/white screening 
of clones in pBS-Notx2. 

E. Homologous Recombination with Helper Virus 
To generate the recombinant adenoviruses 

15 from the plasmids described above, the appropriate El- 
expressing packaging cell line, such as 293 cell line or 
a cell line of Example 4, is co-transf ected with a 
replication defective CI or C68 helper virus, or a left- 
end clipped CI or C68 fragment, as appropriate. These 

20 helper viruses may be deleted of other non-essential 
genes. The infected cell line is subsequently 
transf ected with an adenovirus vector as described above 
bearing the transgene of interest. Homologous 
recombination occurs between the helper and the plasmid, 

25 which permits the adenovirus-transgene sequences in the 
vector to be replicated and packaged into virion capsids, 
resulting in the recombinant adenovirus. 

Transfection is followed by an agar 
overlay for 2 weeks, after which the viruses are plagued, 

30 expanded and screened for expression of the transgene. 
See, for example. Figs. lOA-lOD. Several additional 
rounds of plaque purification are followed by another 
expansion of the cultures. Finally the cells are 
harvested, a virus extract prepared and the recombinant 

35 chimpanzee adenovirus containing the desired transgene is 
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purified by buoyant density ultracentrifugation in a CsCl 
gradient. All of the above procedures are known to those 
of skill in the art. 

F. Another CI Recombinant Adenovirus 
5 Another set of plasmids used to construct 

a CI recombinant adenovirus is described as follows. 
Figs. HA-llH illustrate the scheme employed to generate 
a unique restriction site in the left end of the Cl 
genome. A unique site is necessary in the procedure 

10 employed in generating a recombinant adenovirus, but Cl 
has no such site. There are two Spe-I restriction sites, 
including one at position 1733, within the ElB 21K coding 
region. To replace this Spe-I site with a unique Not-I 
site, plasmid pNEB-Cl-BamG (Fig. IIA) , containing the 

15 left end of the Cl genome, was digested with Spe-I and 
Asc-I, and ligated to the 6204 bp Spe-I/Asc-I fragment 
from the Cl genome (Fig. IIB) . The resulting plasmid, 
pNEB-Cl-AscI-B (Fig. IIC) is then digested with Spe-I, 
filled in with Klenow enzyme and ligated to the synthetic 

20 8bp Not-I linker (Fig. IID) described above, to yield 
pNEB-Cl-AscI-B-NotI (Fig. HE). 

This plasmid is digested with Pac-I and 
Asc-I and the purified fragment is ligated overnight with 
the Cl-Asc-I-A fragment (Fig. IIG) . The ligation 

25 reaction is extracted with phenol: chloroform: iso-amyl 

alcohol, then chloroform, and then 3 fig of sheared salmon 
sperm DNA is added and the DNA is ethanol precipitated. 
The resuspended DNA is used to transfect 293 cells and 
DNA from viral plaques is tested for a Not-I site (IIH). 

30 G. GFP as a Transgene 

Plasmids used to construct exemplary C68 
expression plasmids containing the bacterial green 
fluorescent protein (GFP) gene are illustrated in Figs. 
9A through 9G, respectively. To facilitate the cloning 

35 of the GFP gene into the chimp Adeno expression vectors. 
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pEGFP-1 (Fig. 9A, Clonetech^ Palo Alto^ CA) was digested 
with Sma-I and ligated to the previously described 8bp 
Not-I linker (Fig- 9B) . The resulting plasmid, pEGFP- 
Notx2 (Fig. 9C) has the GFP gene flanked by Not-I sites. 
5 The purified pEGFP-Notx2 Not-I fragment is 

ligated to Not-I digested pCl-CMV-LacZ (Figs. 7K and 9D) 
or pcea-CMV-LacZ (Figs. 8K and 9E) to yield the GFP 
expression vectors pCl-CMV-GFP (Fig. 9F) and pC68-CMV-GFP 
(Fig. 9G and Fig. lOA) , respectively. 

10 Example 6; Deliverv of Transaene to Host Cell 

The resulting recombinant chimpanzee 
adenovirus described in Example 5 above is then employed 
to deliver the transgene to a mammalian, preferably 
human, cell. For example, following purification of the 

15 recombinant C68-CMV-GFP virus of Example 5G, human 

embryonic kidney 293 cells and A549 cells were infected 
at an MOI of 50 particles per cell. GFP expression was 
documented 24 hours post-infection. 

In vivo studies have tested the 

20 infectivity of the virus in murine liver (tail vein 

injection), lung (intratracheal injection) and muscle 
(intramuscular injection). Preliminary data indicate 
that the C68-CMV-GFP recombinant virus transduces all 
three tissues, and GFP expression can be detected. 

25 When administered in vivo, a less severe 

immune response is produced by the human immune system 
(which is naive to the chimpanzee adenovirus sequences) 
than to a human adenovirus construct, thereby permitting 
subsequent administration of the same or another vector. 

30 All references recited above are 

incorporated herein by reference. Ntimerous modifications 
and variations of the present invention are included in 
the scope of the above-identified specification and are 
expected to be obvious to one of skill in the art. Such 
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modifications and alterations to the compositions and 
processes of the present invention, such as selections of 
different minigenes or selection or dosage of the vectors 
or immune modulators are believed to be within the scope 
5 of the claims appended hereto. 
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GTGACGTTTT 


GATGACX3TGG 


TCGTGAGGCG 


GAGTTGGTTT 


GCAAGTTC7C 


GTGGGAAAAG 


180 


T6ACGTCAAA 


CGAGGTGTGG 


TTTGAACACG 


GAAATACTCA 


ATTTTCCCGC 


GCTCTCTGAC 


240 


AG6AAATGAT 


6TGTTTTTGG 


G0G6ATGGAA 


GT6AAAATTC 


CTGATTTTOG 


0606AAAACT 


300 


AAAT6AGGAA 


GTGAATTTCT 


GAGTAATTTC GTOTTTATGA 


GA0G6TG0A& 


TATTTACOOA 


360 


GG0C0GA6TA 


GACTTTGACC 


GATTA0GTG6 


AGGTTTOGAT 


TACOOTGTTT 


TTCACCTAAA 


420 


TTTC0GC6TA 


C6GTGTCAAA 


GTCCTGTGTT 


TTTAOGTAGG 


TGTCAGCTGA 


TCGCTAGAGT 


480 


ATTTAAACCT 


GACGAGTTCC 


GTCAAGAGGC 


CACTCTT6AG 


TGCCA6CXSAG 


AAGAGTTTTC 


540 


TCCTCCXXUIC 


T6C6AGTCAG 


ATCTCCACTT 


TGAAAATGAG 


ACACCTGCGC 


TTCCTGTCCC 


600 


AGGAGATAGT 


CTCCACT6AG 


ACTGGGAATG 


AAATACTGCA 


ATTTGTGGTA 


AATACACTGA 


660 


TGGGAGACGA 


TCCAGAGCCG 


CCTGAGCCAC 


CTTTTGATCC 


TCCTAC9GCTT 


CATGAATTAT 


720 


ATGATTTAGA 


GGTAGACGGA 


CCGGAGGACC 


CTAATGAAAA 


CGACX2TGAAT 


GGGTTTTTTA 


780 


CTGATTCTAT 


GTTATTAGCT 


GCTAATGAGG 


GAGTGGATTT 


AGACCCACCT 


TCTGGAACTT 


840 


TT6ATACTCC 


AGGGGTGATT 


GTGGAAAGCG 


ACATA6ATG6 


GAAAAATTTA 


CCTGATTTGG 


900 


GTGCTGCTGA 


ATTGGACTTA 


TACTGCTATG 


AAGAGG6TTT 


TCCTCAGAGT 


6ATGATGAAG 


960 


ATGTGGAGAA 


TGAGCAGTCA 


ATTCAGACCG 


CCXKX^GGTGA 


GGGAGTGAAA 6CTC0CAGTG 


1020 


ATGGTTTTAA 


GTTGGACTGT 


COGGTGCTTC 


CT6GAGATGG 


CTGTAA6TCT 


TGTGAATTTC 


1080 


AGAG6AAAAA 


TACTGGAGTA 


AAAGAAATAT 


TATGCTOGCT 


TTGTTATATG 


AGAGOGCATT 


1140 


GCCACTTTAT 


TTACAGTAAG 


TGT6TTTAAA 


GTTAAATTTA 


AAGGAACAGT 


AGCTGTTTTA 


1200 


ATAACTCTTG 


AATG6GTGAT 


TTATGTTTTG 


CTGATTTTTA 


TAGGTCCTGT 


GTCTGATGCT 


1260 


GATGAATOGC 


CTTCTCCTGA 


TTCAACTACC 


TCACCTCCTG 


AAATTCGGGC 


ACCCGTTCCT 


1320 


GCAAAOGTAT 


GCAAGCCCAT 


TTCTGTGAAG 


CTTAAGCCTG 


GGAAACGCCC 


TGCTGTGAAT 


1380 


AAACTT6AGG 


ATTT6CTGGA 


6GGTGTGGAT 


GAACXn:TTG6 


ACTTGTGTAC 


COGGAAAATA 


1440 


CCAAGGCAAT 


6AGT6CTCCG 


CACCTGTGTT 


TATCTAATGT 


GACOTCACTG 


TTTTTGTGAG 


1500 


AGTGTCATCT 


AATAAAATTA 


TGTCAGCAGC 


TGAGTGTTTT 


ATTGTTTATT 


GGGTGGGACT 


1560 


TGGGATATAT 


AA6TAGGAGC 


AGACCTGTGT 


GGTTAGCTCA 


cagcag€:ttg 


CTGCGATCCA 


1620 


TGGAGGTTT6 


GGCCATATTG 


GAAGATCTTA 


GGCAGACTAG 


GCAAGTGCTA 


GAAAACGCCT 


1680 






TGGAGATTCT 


GGTTCGGT6G 


tgatctagct 


AGACTAGTCT 




TTAGGATAAA 


GCAGGATTAC 


AGGCAAGAAT 


TTGAAAAGTT 


ATTGGACAAC 


TGTOCAGGAC 


1800 


TTTTTGAAGC 


TCTTAACTTG 


GGCCACCAGG 


CTCATTTTAA 


GGAGAAGGTT 


TTATGAGTTT 


1860 


TGGATTTTTC 


TACCCCTGGT 


AGAACTGCTG 


CTGCTG1!AGC 


TTTCCTTACA 


TTTATATTTG 


1920 


ATAAATGGAT 


CCCACAGACC 


CACTXCAGCA 


AGGGATACGT 


TTTCGATTTC 


ATAGCAGGAG 


1980 


CTTTGTGGA6 


AACATGGAAG 


GCTCGCAGGA 


TGAGGACAAT 


CTTAGATTAC 


TGGCCAGTAC 


2040 
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A6CCTCTGGG 


TGTAGCAGGG 


ATCCTGAGAC 


ACCCACOGAC 


GATGCCAGCG 


GTTTTGGAGG 


2100 


A6GTGCAACA 


AGAGGACAAT 


CCGAGAGCCG 


GCCTGGACCC 


TGCGGTGGAG 


GAGGCGGAGG 


2160 


AGTAGCT6AC 


TTGTTTCCTG 


AACTGCGACG 


GGT6CTTACT 


AGATCTACAA 


CCA6TGGG0G 


2220 


G6ACA6666C 


ATTAAGAGGG 


AAAGGAATOC 


TAGTGOAACT AATCCCA6AT 


CTGAGTTGOC 


2280 


TTTAA6TTTO 


ATGAGTOGCA 


GAOGTCCTGA 


AACTATATGG 


TG6GAT6AG6 


TTCAGAATOA 


2340 


GG6GAGG6AT 6AA0TATCAA TATT6CAA6A GAAATATTCT CTAGAAGA66 


TGAAAAGATG 


2400 


TTG6TTGGA6 


CCTGAGGATG 


ATTGGGAGGT 


TGCCATTAGG AATTATGOCA AOATAGCTTT 


2460 


6AGGCCTGAT 


AAATTGTACA 


GAATTACTAA 


ACGGATTAAT ATTAGAAATG 


CATGCTATAT 


2520 


ATCAGGGAAT 


GGGGCTGAGG 


TAGTGATAGA 


CACTCAGGAC 


A6AACAGTTT 


TTAGATGCT6 


2580 


TAT6ATGGGT 


ATGTGGCCAG 


GGGTGATTGG 


CATGGAGGCG 


GTAACCTTTA 


TGAATGTAAA 


2640 


GTTTA6AGG6 


GATGGGTATA 


ATGGTGTGGT 


TTTTATGGCT 


AATACTAAAT 


TGATTTTGCA 


2700 


TGGTTGTAGC 


TTTTTTGGTT 


TTAATAATAT 


ATGTGTGGAA 


GCTTGGGGGC 


AGGTCAGTGT 


2760 


AAGAGGCTGT 


AGTTTCTATG 


CATGCTGGAT 


TGGAACATCA 


GGCA6GAOCA 


AGAGTCAATT 


2820 


GTCTGTGAAG 


AAAT6TATGT 


TTGAGAGATG 


TAACCTGGGC ATACTAAATG 


AAGGAGAAGC 


2880 


CAGAGTCA6C 


CACTGTGCTT CTTCCX3AAAC TGGCTGTTTC ATGTTGATGA AGGGAAAT6C 


2940 


CAATGTGAAA 


CATAATATGA 


TCTGG66ACC 


CTGAGATQAC 


AGGCCTTATC 


AGATGCT6AC 


3000 


ATGTGCTGGC 


GGACATTGCA 


ATATGCTGGC 


TA0C6TGGAT 


ATTGTTTCTC 


ACOCAOGGAA 


3060 


GAAATGGCCT 


GTTTTGGAAC 


ATAATGTGAT 


6ACCAAATGT 


ACCATGCACG 


TAGGTGGACG 


3120 


GA6AGGAATG 


TTAATGCCAT 


ACCAGTGTAA 


CATGAATAAT 


GTGAAA6TGA 


TGTTOGAACC 


3180 


A6ATGGATTT 


TCCAGAATGA 


GTTTAACAGG 


AATCTTTGAC 


ATGAATCTGC 


AAATATGGAA 


3240 


GATCCTGAGA 


TATGATGACA 


CGAAGTCGAG 


GGTACGCGCA 


TGOGAGTGOG 


GGGGCAAACA 


3300 


TGCCAGGTTC 


CAGCCGGTGT 


GTGXGGATGT 


GACTGAAGAA 


CTAAGGCGA6 


ATCATTTGGT 


3360 


GATTGOCTGC 


ACTGGA60GG 


AGTTCGGTTC 


TAGTGGTQAA 


6AAACT6ACT 


AAAGTGAGTA 


3420 


GTAGTGGGAT 


ACTTTGGATG 


GGCTCTTATG 


TGAATATGGT 


GGACAGATTG 


GGTAAATTTT 


3480 


GTTCTTTCTG 


TCTTGCAGCT 


GTCATGA6TG 


GAAG06CTTC 


TTTT6AGGGG 


GGAGTCTTTA 


3540 


GCCCTTATCT 6ACGGGCCGT CTCCCACCAT 


GGGCAGGAGT 


TCGTCAGAAT 


GTCATGGGAT 


3600 


CCACTGTGGA 


TGGGAGACCA 


GTCCAGCCCG 


CCAATTCATC AACACTGACC 


TATGCGACTT 


3660 


TGAGCTCTTC 


ACCCTT6GAT 


GCAGCTGCAG 


CT6CTGC06C 


TTCTGCTGCC 


GCCAATACOG 


3720 


TCCTTGGAAT 


TGGCTATTAT 


GGAA6CATC6 


TTGCGAATAC 


CA6TTCCTCA 


AATAACCCTT 


3760 


CGACCCTGGC 


TGAGGACAAG 


CTACTTGTTC 


T7TTGGCGGA 


GCTTGAGGCG 


TTGACCCAGC 


3840 


GCCTGGGTGA 


ACTGTCTCAG 


CAGGTGGCCC 


AGCTGCGCGA 


GGAAACTGAG 


TCTGCTGTTG 


3900 


CCACAGCAAA 


GTCTAAATAA 


AGATTAATCA 


ATAAATAAAG 


GAGATACTTG 


TTGATTTTAA 


3960 
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XCTGTAAT6A 


ATCTTTATTT 


GATTTTTCGC 


GCACXK3TATG 


CCCTGGACCA 


CCGGTCTOGA 


4020 


TGATTGAGAA 


CTC6GTGGAT 


TTTTTCCAGG 


ACCCT6TAGA 


GGTOGGATTG 


AATGTTTAOA 


4080 


TAGATGGOCA 


TTAGGCGGTC 


TCGGGG6TGG 


AGATAGCTCC 


ATTOAAGAGC 


CTCAT6CTCC 


4140 


GGG6TAGTAT 


TATAAATCAC 


CCAGTCATAA 


GAAGGTOGGA 


6TGGATGATG 


TTGCAGAATA 


4200 


TCTTTAA60A 


GCAGGCT6AT 


TGCAACTGGG 


AGC0CCTTG6 


TGTATGTGTT 


TACAAATCTG 


4260 


TTAAGCTQAG 


ATGGATGCAT 


TCTGGGTGAA ATTATATGCA TTmGACTG 


TATCTTGAGG 


4320 


TTG6CAATGT 


TGCC6CCCAG 


ATCCGGTCTC GGGTTCATGT 


TATGCAOGAC 


CACCAAGAC6 


4380 


6TGTATC0GC 


TGCACTTAGG 


AAATTTATCA 


TGCAGCTTAG 


ATGGAAAAGC 


ATGAAAAAAT 


4440 


TTGGAGACGC 


CTTTGTGTCC 


GCCCAGATTC 


TCCATGCACT 


CATCCATGAT 


GATAGCGATG 


4500 


GGGCCGTGGG 


CGGCGGCACG 


GGCAAACACA 


TTCCGGTGGT 


CTGACAGATC 


ATAGTTATGC 


4560 


TCCT6AGACA 


GGTCATCATA 


AGCCATTTTA 


ATAAACTTTG 


GGCGGAGGGT 


GCCAGATTGG 


4620 


GGTATAAAT6 


TACCCTOGGG 


CCCCGGAGCA 


TAGTTTCCXT 


CACAOATTTG 


GATTTCCGAG 


4680 


6CTTTCAATT 


CAGAGGGQGG 


GATCATGTCC 


ACCTGAGGGG 


CTATAAAAAA 


TACOGTTTCT 


4740 


G6GGCTGGGG 


T6ATTAACTG 


TGATGATAGC 


AAATTCCTGA 


6GAGCTGTGA 


CTTGCCACAC 


4800 


CCAGTGGGGC 


OGTAAATGAC 


CCOGATTAOG 


GGTTGCAGAT 


GGTAGTTTA6 


GGAGCGGCAG 


4860 


CTGCOGTCCT 


CTC6GAGCAG 


GGGGGCCACT 


TCGTTCATCA 


TTTCCCTTAC 


ATGGATATTT 


4920 


TCCCGCACGA 


AGTCCGTTAG 


GAGGCGCTCT 


CCACCTAGCG 


ATAAAAGTTC 


CTGGAGGGAG 


4980 


6AGAAGT7TT 


TGAGCGGCTT 


TAGCCCGTCA 


GACATGGGGA 


TTTTGGAAAG 


AGTCTGTTGC 


5040 


AAGAGCTCAA 


GCCGGTCCCA 


GAGCTCGGTA 


ATGTGTTCTA 


TGGCATCTCG 


ATCCAGCAGA 


5100 


CCTCCTOGTT 


TCGOGGGTTG 


GGACGGCTCC 


TGGAGTAGGG 


TATCAGAOGA 


TGGGCGTCCA 


5160 


GCGCTGCCAG 


GGTCCGGTCT 


TTCCAGGGTC 


GCAGCGTCCG 


AGTGAGGGTT 


GTT7C0GTCA 


5220 


GAGTGAAGGG 


GTGOGCGCCT 


GGTTGGGOGC 


TTGCGAGGGT GCGCTTCAGG 


CTCATCCTGC 


5280 


TGGTOGAGAA 


COGCTGCOGA 


TCGGCGCCCT 


GCATGTGAGC CATGTAGCAG 


TTTACCATGA 


5340 


GTTOGTAGTT 


GAGTGCCTOG 


GCTGCGTGAC 


CTTTGGOGOG 


6AGCTTACCT 


TTGGAAGTTT 


5400 


TCTGGCAGGC 


AGGGCAG7AC 


AGACACTTGA 


GGGCATATAG 


CTTGGGC60G 


AGGAAGATTC 


5460 


ATTCGGGGGA 


GTATGCATCC 


GOGCCGCAGG 


AGGCGCAGAT GGTTTCGCAT 


TCCACGAGCC 


5520 






GGGTCAAAAA 


CAAGTTTACC 


GCGAT6TTTT 






TCTTACCTTT 


GGTCTCCATG 


AGTTCGTGTC 


CCCGCTGGGT 


GACAAAGAGG 


CT6TCCGTGT 


5640 


CCCCGTAGAC 


CGATTTTATG 


GGCCTGTCCT 


CGAGCGGAGT 


GCCT06GTCC 


TCTTCOTAGA 


5700 


GGAACCCAGA 


CCACTCTGAT 


ACAAAGGCGC 


GCGTCCAGGC 


CAGTACAAAA 


GAGGCCA06T 


5760 


GGGAGGGGTA 


GGGGTCGTTA 


TCAACCAGGG 


GGTCCACCTT 


CTCCACAGTA 


TGTJUU^CACA 


5820 


TGTCCCCCTC 


CTCCACATCC 


AAGAAGGTGA 


TTGGCTTGTA 


AGTGTAGGCC 


A06TGA0CAG 


5880 
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TTCCAGCCGG 


TGGGGTATAA 


aagggggcxk; 


6TCTCTGCTC 


GTCCTCACTG 


TCTTCCXSGAT 


5940 


06CT6TCCA6 


GAGCGCCAGC 


TGTTGGGGTA 


GGTATTCCCT 


TTCGAAGGOG 


GGCATAACCT 


6000 


CTGCACTCA6 


GTT6TCAGTT 


TCTAGGAACG 


AGGAGGATTT 


GATATT6ACA 


6TGCCAGTTG 


6060 


A6AT6CCTTT 


CATGA6ACTC 


TOGTCGATTT GGTCAGAAAA 


GACAATTTTC 


TTGTT6TGAA 


6120 


6CTTGGT06C 


AAAGGATCOG 


TATAGGGCAT TGGATAAAA6 


CTTGGCOATG 


GAGOGGATGG 


6180 


TTTGGTTCTT 


ATCCTTGTCC 


GCACGCTCCT 


TGGCAGCAAT 


GTT0A6TTGG 


ACGTACTOGC 


6240 


G06CCAGGCA 


CTTCCATTCA 


GGAAAGATGG 


TOGTCA6TTC 


ATCTGGGAOG 


ATTCTGACTC 


6300 


6CCA6CCC06 


ATTATGCAGG 


GTGATTAGAT 


CCACACTGGT 


GGCCACCTCG 


CCTCGGAGGG 


6360 


GCTCGTTGGT 


CGAGCAGAGT 


CGACCCCCTT 


TTCTTGAACA 


GAAAGGGGGG 


AGGGGGTCTA 


6420 


GCATGAGTTC 


ATCAGGGGG6 


TCTGCATCCA 


TGGTGAATAT 


TCCTGGGAGC 


AGATCTTTGT 


6460 


CAAAATAGCT 


AATGTGAGCG 


GGGTCATCCA 


AAGCCATCTG 


CCATTCTCGA 


GCTGCCAGCG 


6540 


CGGGTTCATA 


GGGATTGAGT 


GGGGTGCCCC 


ATGGCATGGG 


GTGGGTGAGT 


GCAGAGGCAT 


6600 


ACATGCCACA 


GATGTCATAG 


ACATACAGTG 


GTTCTTOGAG 


6ATGC0GATG 


TAGGTGGGAT 


6660 


AACAG06GCC 


CCCTCTGATG 


CTTGCTCGCA CATAGTCATA 


GAGTTCAT6C 


6AGGGGGCGA 


6720 


GAA6ACCOGG 


GCCCAGATTG 


GTAOGGTTGG 


GTTTTTCAGC 


TCTGTAAA06 


ATCTGGOGAA 


6780 


AGATGGCATG 


GGAATTGGAA 


GAGATGGTAG 


GTCTCTGAAA 


GATGTTAAAA T6GGCAT6A6 


6840 


6CAG6CCCAC 


AGAGTCCCTG 


ACGAAGTGGG 


CATAGGACTC 


TT6CAGCTTG 


GCCACCAGCT 


6900 


OGGCGGTGAC 


GAGCACATCC 


AGGGCGCAGT 


AGTCAAGG6T 


CTCTT6AATG 


ATGTCATAAC 


6960 


CTGGTTGGTT 


TTTCTTTTCC 


CACAGCTCGC 


GGTTGAGGA6 


GTATTCTTOG 


CGATCTTTCC 


7020 


AGTACTCTTC 


GAGGGGAAAC 


CCGTCTTTGT 


CTGCACGGTA 


A6AGCOCAGC 


ATGTAGAACT 


7080 


GATTGACTGC 


CTTGTAGGGG 


CAGCATCCCT 


TCTCCACGGG 


GAGAGAGTAT 


GCTTGGGCGG 


7140 


CCXTGOGCAG 


AGAGGTATGA 


GTGAGGGGAA AGGTGTCCCT 


GACCATGACT 


TTAAGGAACT 


7200 


GATACTTGAA 


6T0GAT6TGA 


TTACAGGCCC 


CCTGTTCCCA 


6AGTTGGAAG 


TCTACCCGCT 


7260 


TCTTGTAGGC 


GG6ATTGGGC 


AAAGCGAAAG 


TAACATOGTT 


GAAGAGTATC 


TTGCCTGCCC 


7320 


TGGGCATGAA 


ATTGCGGGTG 


ATGCGGAAAG 


GCTGGGGCAC 


TTTTGCTOGG 


TTATTGATCA 


7380 


CCTGAGC6GC 


TAGGAOGATC 


TCATCAAAGC 


CATTGATGTT 


GTGCCCCACT 


ATGTACAGTT 


7440 


CTATGAATCG 


A6GGGTGCCC 

•V^v^V^v^v ^^^^^^^^^ 


TTGACATGAG 


GCAGCTTCTT 


AAGTTCXTCG 


AAAGTTAG6T 


7500 


CTGTGGGGTC 


AGAGAGAGCA 


TAGTGTTCGA 


GGGCCCATTC 


GTGGAGGTOA 


6GGTTOGCAT 


7560 


TGAGGAAGGA 


GGACGAAAGA 


TCCACTGCCA 


GTGCTGTTTG 


TAACTGGTCC 


CGGTACTGGC 


7620 


GAAAATGCTG 


GCOGACTGCC 


ATCTTTTCTG 


GGGTGACACA 


GTAOAAGGTT 


TTGGGGTCCT 


7680 


GCTGCCAGCG 


ATCCCACTTT 


AGTTTCATGG 


CGAGGTCGTA 


GGCGATGTTG 


ACGAGCCGCT 


7740 


CX3TCCCCAGA 


GAGTTTCATG 


ACCAGCATGA AGGGTATGAG 


TTGCTTGCCA 


AAGGACCCCA 


7800 
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TCCUGGTGTA 


GGTTTCCACA 


TCGTAGGTGA 


GGAAGAGCCT 


TTCCGTGOGA 


GGATGAGAGC 


7860 


C6ATC6G6AA 


GAACTOGATC 


TCCTGCCACC 


AGTTGGAGGA ATGGCTGTTG 


ATGTGAT6GA 


7920 


AOTAOAAATC 


CCTGOGGOGC 


GCCGAGCATT 


CATGCTTGTG 


CTTGTACAGA 


OOGOC9GCAOT 


7980 


ACTOGCAGOG 


CTGCAC6GGA 


TGCACCTCGT 


GAATGAOnG 


TACX3TG0CTT 


CCTTTGAOGA 


8040 


GAAATTTGA6 


T6GGAAGTTG 


AGGCCTGG06 


CTTTTACCTC 


GCTCTCTACT 


ATGTTATCTG 


8100 


CATOGGCCTG 


GCCATCTTCT 


GTCTCGATGG 


T6GTCATGCT 


AACAAGCCXX: 


OGCGGGAGGC 


8160 


AAGTCCA6AC 


CTOGGGACGG 


GAGGGGCGGA 


GCTCGAGGAC 


GAGAGOGOGC 


AGGOCGGAGC 


8220 


TGTCCAGGGT 


CCTGAGACGC 


TGCGGAGTCA 


GGTTAGTAGG 


TAGGGTGAGG 


AGATTGACTT 


8280 


GCATGATCTT 


TTCGAGGGCA 


AGCGGGAGGT 


TCAGATGGTA 


TTTGATCTCC 


ACGGGTCOGT 


8340 


TGGTGGAAAT 


GTCGATGGCT 


TGCAGGGTTC 


CGTGCCCTTT 


GGG06CCACC 


ACCGTGCCCT 


8400 


TGTTTTTCCT 


TTT6GGCGGA 


GGCGGTGGTG 


TTGCTTCTTG 


CATGTTCAGA 


AGOGGTGGCG 


8460 


AGGGCGCGCG 


CCTGGCGGTA 


GAGGCGGCTC 


GGGCCCOGGC 


GGGATGGCTG 


GCAGTOGGAC 


8520 


GTCGG06C0G 


OGCGOGGGTA 


GGTTCTGGTA 


CTGOGCOCTG 


AGAAGACTTG 


OGTGOGCGAC 


8580 


AACGC6GCGG 


TTGAOGTCCT 


GGATCTGTCG 


CCTCTGGGTG 


AAAGCTACCG 


GCCCCGTGAG 


8640 


CTTGAACCTG 


AAAGAGAGTT 


CAACAGAATC 


AATCTOGGTA 


TCGTTGAOOG 


CG6CTTGTCT 


8700 


TAGGATCTCT 


TGTACGTCGC 


COGAGTTGTC 


CTGGTAGGCT ATCT0G6CCSA 


TOAACTGCTC 


8760 


GATTTCTTCC 


TCCTGAAGAT 


CTCCGCXrGCC 


TGCTCTCTCG 


ACGGTGGCOG 


OGAGGTOGTT 


8820 


GGAGAT6G6A 


CCCATGAGTT 


GAGAGAATGC 


ATTCATGCCT 


GCCTCGTTCC 


AGACGCGGCT 


8880 


GTAGACCACG 


GCCCCCTCGG 


GATCTCTCGC 


GCGCATGACC 


ACCTGGGCGA 


GGTTGAGCTC 


8940 


CACGTGGCGG 


GTGAAGACCG 


CATAGTTGCA 


TAGGCGCTGG 


AAGAGGTAGT 


TGAGTGTGGT 


9000 


GGC6ATGTGC 


TCGGTGACGA 


AGAAATACAT 


GATCCATCGT CTCAGCGGCA 


TCTGGCT6AC 


9060 


ATCGCCGAGG 


GCXTCCAAGC 


GTTCCATGGC 


CTC6TAAAAG 


TCCAOGGCAA 


AGTTGAAAAA 


9120 


CTGGGAGTTG 


OGOGCGGACA CGGTCAACTC 


CTCCTCCAGA AGACGGATGA 


GTTCGGCGAT 


9180 


GGTGGOGOGC 


ACCTGGOGCT 


OGAAAGCTCC 


GGGGATTTCT 


TCCTCCTCTT 


CTTCTATCTC 


9240 


CTCTTCGACT 


AACATCTCTT 


CTTCCTC7TTC 


AGGCGGGGGC 


GGAGGAGGAG 


GGGGCACGOG 


9300 


GOQACGCCGG 


CGGOGCACGG 


GCAAACGGTC 


GATGAATCTT 


TCAATGACCT 


CTCCGCGGCG 


9360 


GCGG0GGAT6 


GTCTCGGTGA 


CGGCACGGCC 


GTTTTCCCTG 


GGTCTCAGAG 




9420 


TCCX30GCATC 


TCCCTAAAGT 


GGTGACTGGG 


GGGCTCTCCX; 


TTGGGCAGGG 


ACACAG06CT 


9480 


GATTATGCAT 


TTTATCAATT 


GCCCCGTAGG 


GACTCCGCGC 


AAGGACCTGA 


TC6TCTCAA6 


9540 


ATCCACGGGA 


TCGGAAAACC 


TTTCGACGAA 


AGCGTCTAAC 


CAGTCGCAAT 


06CAAGGTAG 


9600 


GCTGAGCACT 


GTTTCTTGTA 


GGCGGGGGTG 


GCTACAOGCT 


GGGTCGGGGT 


TCTCTATTTC 


9660 


TTCTCCTTCC 


TCCTCTCGGG 


AGGGTGAGAC 


GATGCTGCTG 


GTGATGAAAT 


TAAAATAG6C 


9720 
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AOTTCTGAGA 


CGGCGGAT66 


TGGCGAGGAG 


CACXAGGTCT 


TTOGGACOGG 


CTTGCTGGAT 


9780 


606CAGGCGA 


TTGGCCATTC 


CCCAAGCATT 


ATCCTGGCAC 


CTGGCXUUSAT 


CTT7GTAGTA 


9840 


6TCTTGCATA 


AGTCGCTCCA 


06GGCACTTC 


TTCTTCGCCC 


GCTCTGCGAT 


GCA7GCGCGT 


9900 


6A6CCCAAAC 


GGGCGGATGG 


GCTGGACAAG 


TGCCAGGTCC 


GCTAOGACCC 


TTTCTGOGAG 


9960 


6ATG6CTT6C 


TGCACCTGGG 


TGAGGGTG6C 


TTGGAAGTOG 


TCAAAGTCCA 


CAAAGOGATG 


10020 


0TA66CCC00 


GTGTTGATGG 


TGTAAGA6CA 


GTTGG0CAT6 


ACTGACCAGT 


TGACTGTCTG 


10080 


GT6CCC0GG6 


06CACAATCT 


CGGTGTACTT 


GAGGOGGGAG 


TAGGOGOGGG 


TOTCAAAGAT 


10140 


6TAAT0GTT6 


CAGGCGOGCA 


CCAGGTACTG 


GTAGCCGATT 


AGAAAATGTG 


GTGGCGGCZG 


10200 


60GGTATAG6 


GGCCATCGCT 


CTGTAGCCGG 


GGCGGCAGGA 


GCGAGGTCTT 


CCAGCATGAG 


10260 


606GTGATAA 


CCGTAGATGT 


ACCTGGACAT 


CCAGGTGATA 


CCGGAGGC6G 


TGGTGGATGC 


10320 


CC6AGGGAAC 


TCGCGTACGC 


GGTTCCAGAT 


GTTGCGCAGC 


GGCATGAAGT 


AGTTCATGGT 


10380 


AGGCACG6TT 


TGGCCCX^TGA 


GGCGCGCACA 


6TCGTTGATG 


CTCTAGACAT 


ACGGGCAAAA 


10440 


ACGAAA6CGG 


TCAGC6GCTC 


GACTCCGTGG 


CCTGGAGGCT 


AAGCGAACGG 


GTTGGGCTGC 


10500 


606TGTACCC 


CGGTTC6AAT 


CTCGGATTAG 


6CTGGAGCCX} 


CAGCTAAOGT 


GGTACIGGCA 


10560 


CTCCOGTCTC 


GACCCAAGCC 


TGCACAAAAC 


CTCCAGGATA 


CGGAGGGGGG 


TCGTTTTTTT 


10620 


TTTTTTTGCT 


TTTCCTGGAT 


GG6AGCCAGT 


GCTGCGTCAA 


GCTTTAGAAC 


GCTCAGTTCT 


10680 


0GG6CCTGG6 


AGTGGCTCGC 


GCCCGTAGTC 


TGGAGAATCA ATOGCCAGGG 


TTGOGTTGOG 


10740 


GCATGCCCC6 


GTTC6AGTCT 


TAGC60GC0G 


6AT0GGC0GG 


TTTCC9GOGGC 


AAACGAGGGT 


10800 


TTGGGAGCCC 


CGTCATTTCT 


AAGACCCCGC 


TAGCOGACTT 


CTCCAGTTTA 


CGGGAG06AG 


10860 


CCCTCTTTTT 


TTTTTTTGTT 


TTTGTTGCCC 


AGATGCATCC 


OGTGCTGCGA 


GAGATGOGCC 


10920 


CCGAGCAACA 


GCCCCCTTCT 


CAGCAGCAGC 


CACAGCAACA 


GCCACAAAAG 


GC7CTTCCTG 


10980 


CTCCTGTAAC 


TACT6CAGCT 


GCAGCCGTCA 


GCGGCGCGGG 


ACAGCCCGCC 


TATGATCTGG 


11040 


ACrrTGOAAGA 


GGG0GA6GGA 


CTGGC6CGTC 


TGGGTGCACC 


ATOGCCOGAG 


06GCACC0GC 


11100 


GGGTGCAACT 


GAAAAA6GAT 


TCTCGCGAGG 


CGTAOGTGCC 


GCAGCAGAAC 


CTGTTCAGGG 


11160 


ACAGGAGOGG 


TGAGGAGCCG 


GAGGAAATGC 


GAGCTTCOOG 


CTTTAAOGCG 


GGTOGCGAGC 


11220 


TGCGTCATGG 


TCTGGACCX3A 


AGACGGGTGC 


TGCGCGATGA 


TGATTTT6AA GTCGATGAAG 


11280 


T6ACAGGGAT AAGTCCTGCT AGG6CACATG TG6CTGCGGC CAACCTA6TA TCAGCCTACG 


11340 


A6GAGAC06T 


GAAGGA6GAG 


C6CAACTTTC 


AAAAATCTTT 


CAACAATCAT 


GTGCGCACCC 


11400 


TGATTGCCCG 


CGAGGAGGTG 


ACACTGGGTC 


TAATGCACCT 


GTGG6ACCTG 


ATGGAAGCTA 


11460 


TTACCCAGAA 


CCCCACCAGC 


AAACCTCTGA 


CCGCTCAGCT 


GTTTCTAGTG 


GTGCAAGATA 


11520 


GCAGA6ACAA 


T6AGGCATTT 


AGGGAGGCGC 


TGTTGAACAT 


CACTGAGCCC 


GAGGGGAGAT 


11580 


GGTTGTATGA 


TCTTATCAAT 


ATTCTGCAAA 


GTATCATAGT 


GCAAGAAOGT AGCCTGGGTC 


11640 
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TGOCTGAGA]^ 


GGTOOCT6CT 


ATTAACTACT 


CGGTCTTAAG 


CCTGGGCAAG 


CACTACGCTC 


11700 


0CAA6ATCTA 


TAAAACCCCA 


TACGTACCTA 


TAGACAAGGA 


GGTTAAGATA 


OATGGGTTTT 


11760 


ATAT0C6GAT 


GACTCTGAAG 


GTGCTGACCT 


TGAGTGAOGA 


TCTGGOAGTG 


TACOGCAAOG 


11820 


AGAGOATGCA 


CXX^TGCAGTG 


AGCGCCAGCA 


GAAGGCiGTQA 


OCTOAGOGAG 


AGA6AACTTA 


11880 


TGGACA6CTT 


OCAAAGAGCT 


CTGACGGGGG 


CTGGAACCOA 


GOGGGAOAAC TACTTTGACA 


11940 


TGG6A6CG6A 


TTT6CAATGG 


CAGCCCAGTC 


GCAGGGCOCT 


GGACGCAGCA GGGTATGAGC 


12000 


TTCCTTACAT 


AGAAGAGGOG 


GATGAA6GCC 


ATGAOGAGGA 


GGGCGAGTAC 


CTG6AAGACT 


12060 


6ATG0C6C36A 


CCATCCATAT 


TTTTGTTAGA 


TGCAGCAACA 


GCCACCTCCT 


GATCCOGCAA 


12120 


TGCGGGC6GC 


GCT6CAGAGC 


CAGCCGTCCG 


GCATTAACTC 


CTCGGACGAT 


TGGACCCAGG 


12180 


CCAT6CAAC6 


GATCATGGCG 


CTGACGACCC 


GCAACCCCGA 


AGCCTTTAGA 


CAGGAACCCC 


12240 


AG6CCAAC06 


CCTTTCTGCC 


ATCCTGGAGG 


CCGTAGTGCC 


CTCCCGCTCC 


AACCCCACCC 


12300 


AC0A6AAGGT 


CCTGGCTATC 


GTGAACGCGC 


TGGTGGA6AA 


CAAAGOCATA 


CGTCCCSGATG 


12360 


AGGCTGOACT 


GGTATACAAT 


GCCCTATTGG 


AGCGC6TA6C 


COGTTACAAC 


AGCAGCAAOG 


12420 


TGGAGACGAA 


CCTTGACCGG ATGGTGACCG 


ATGTGCGC6A 


GQCTGTGTCT 


GAGOGCOAGC 


12480 


GGTTCCAGCS6 


A6ACTCCAAT 


CTAGGGTCGC 


TGGTGGCGTT 


6AAC6CCTTC 


CTCAGCACCC 


12540 


AGCCTGCCAA 


CGTGCCTCGC GGCCAGCAAG 


ACTACAGAAA 


CTTTCTAAGT GGATTAAGAC 


12600 


TGATGGTGGC 


OGAAGTCCCT 


CAAAGTGAGG 


TGTACCAGTC 


OGGGCCAGAC 


TACTTTTTOC 


12660 


AGACCAGCAG 


ACAGGGCTTG 


CAGACAGTGA 


ACCTGAGCCA 


GGCTTTTAAG 


AACCTGAATG 


12720 


GTCTGTGGGG 


AGTGCGTGCC 


CCAGTAGGAG 


ATCGGGCAAC 


CGOXSTCTAGC 


TTGCTAACCC 


12780 


CCAACTCCCG 


CCTACTACTG 


CTCTTGGTAG 


CCCCATTCAC 


TGACAGOGGT 


AGCATCGACC 


12840 


GCAATTCTTA 


CTTGGGCTAT 


TTGTTGAACC 


TGTAT06C6A 


GGCCATAGGG 


CAAACTGAGG 


12900 


TAOATGAGCA 


AACCTATCAA 


GAAATTACCC 


AAGTGAGC06 


OGCTCTOGGT 


GAGGAAGACA 


12960 


CTGGCAGCTT 


GGAAGCCACC 


TTAAACTTCT 


TGCTGACCAA 


CCGGT0GCA6 


AAGATCCCTC 


13020 


CTCAGTATGC 


GCTTAC0GC6 


GAGGAGGAAC 


GAATCCTGAG 


ATACGTGCA6 


CAGAGC6TGG 


13080 


OACTTTTCCT 


AATGCAGGAG 


GGGGCGACTC 


CTACTGCTGC 


6CTAGATATG 


AGAGCCOOAA 


13140 


ACATQGAGOC 


CA6CATGTAT 


GCCAGTAACA 


GGCCTTTTAT 


GAATAAACTA 


CTAGACTACT 


13200 


TAGACAGGGC 


GGCTGCTATG 


AACTCTGATT 


ATTTCACGAA 


TGCTATACTG AACCCCCATT 


13260 


GGCTGCCCCC 


ACCTGGGTTC 


TATACGGGCG 


AGTATGACAT 


GCCCGACCCC 


AATGACGGGT 


13320 


TTTTATGGGA 


CGATGTGGAC 


AGTAGTGTTT 


TCTCCCCGCC 


TCCTGGTTAT 


AACACTTGGA 


13380 


A6AAGGAAGG 


GGGCSGATAGA 


AGGCATTCTT 


CCGTATCGCT 


GTCCGGGGCA 


ACX^GGTGCTG 


13440 


TOGCAGOGGT 


GCCCGAGGCC 


GCAAGTCCTT 


TCCCTAGTTT 


GCCATTTTCG 


CTAAACAGTG 


13500 


TA06CAGCAG 


TGAGCTGGGC 


AGGATCACGC 


GTCCGCGCTT 


GATGGGOGAG 


GAGGAGTACT 


13560 
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TOAATOACTC 


GCTGTTGAGG 


CCAGAGCGGG 


AGAAGAACTT 


CCCCAATAAC 


GGGATAGAGA 


13620 


OCCTGGTGGA 


TAAGATGAGC 


CGCTGGAAGA 


CGTACGGGCA 


CGAGCACAGG 


GACGAGCCCC 


13680 


GAGCTA6GA6 


CAGC6CCGGC 


GCCCX3TAGAC 


GCCAGCGGCA 


CGATAGGCAG 


OGGGGACTTG 


13740 


TGTGGGA06A 


TGA6GATTCC 


GCOGAOGACA GCAGOGTGTT GGACTTGGGT GGGAGTGGT6 


13800 


6TGGTAA0CC 


6TTT6CTCAC 


CTGCGCCCCC GOGTTGGGOG 0CT6ATGTAA AAACCX^AAA 


13860 


TAAATG6TAC 


TCACCAAGGC 


CATGGCGACC AGCGTGOGTT 


OGTTTCTTCT 


CTGTT6TATC 


13920 


TAGTAT6ATC 


AGGOGAACCG 


TGCTAGGAGG 


AGCGGTGGTG 


TATCOGGAGG 


GTCCTCCTCC 


13980 


TTOOTATOAA 


AGCGTGATGC 


AGCAGGCGGC 


GGCGGCGGOG 


AT6CAGCGAC 


CACTGGAGGC 


14040 


TCCCTTTOTC 


CCCCCTCGGT 


ACCTGGCACC 


TACGGAGGGG 


AGAAAGAGCA 


TTCGTTACTC 


14100 


GGA6CTGGCA 


CCATTGTATG 


ATACCACCCG 


GTTGTATTTG 


GTGGAGAACA 


AGTOGGCGGA 


14160 


CATCGCCTCA 


CTGAACTATC 


AGAACGACCA 


CAGCAACTTC 


CTCACCACGG 


TGGTGCAAAA 


14220 


CAAT6ACTTT 


ACCCCCACGG 


AGGCCAGCAC 


CCAGACAATC 


AACTTTGACG 


AGCGGTOGCG 


14280 


AT66GGTGGT 


CAGCTGAAGA 


CTATCATGCA 


CACCAACATG 


CCCAAOGTGA 


AOGAGTACAT 


14340 


6TTTAGCAAC 


AA6TTCAAAG 


CTCGGGTGAT 


GGTGTCGA6A 


AAGGCTOCTG 


AAGGTGTCAC 


14400 


AGTA6ATGAC 


AATTAT6ATC 


ACAAGCAGGA 


TATTTTGGAA 


TATGAGTGGT 


TTGAGTTTAC 


14460 


TCTACGGGAA 


GGGAACTTCT 


CAGCCACAAT 


GACCATT6AC 


CTAATGAACA ATGCCATGAT 


14520 


TGATAATTAC 


CTTGAAGTGG 


GCAGACAGAA TGGAGTGTTG GAGAGTGACA TTGGTGTTAA 


14580 


ATTTGAGACC 


AGGAACTTTA 


AACTGGGTTG 


GGATCOGGAA ACTAAGTTGA 


TTATGCCTGG 


14640 


GGTTTACACC 


TATGAGGCAT 


TCCATCCTGA 


CATTGTATTG 


TT6CCTGGTT 


GTGGGGTTOA 


14700 


CTTTACT6AA 


AGTOGCCTTA 


GTAACTTGCT 


TGGTATCAGG 


AAAAGACACC 


CATTCCAGGA 


14760 


GGGTTTTAAG 


ATCTTGTATG 


AGGATCTTGA 


AGGGGGTAAT 


ATCCCAGCCC 


TTTTGGATGT 


14820 


AGAAGCCTAT 


GAGAACAGTA 


AGAAAGAACA 


AGAAGCCAAA 


ACAGAAGCGG 


CTAAAGCTGC 


14880 


TGCTATTGCT 


AAAGCCAATA 


TAGTTGTCAG 


CGACCCTGTC 


AGGGTGGCTA 


ATGCCGAAGA 


14940 


AGTCAGAG6A 


GACAACTATA 


CAGCTACATC 


TGTTGGAACT 


GAAGAATCGC 


TATTGACTAC 


15000 


TGCTGOGACT 


GGAACCAAAA 


ATACAGAGAC 


AGGACTCACT 


ATGAAACCTG 


TAGAAAAAGA 


15060 


TAGCAAGAGT 


AGAAGTTACA 


ATGTCTTGGA AGATAAAGTT AATACAGCCT 


ACCGGAGCTG 


15120 


GTATCTGTCC 


TACAACTATG 


GOGACCCTGA AAAAGGAGTC 


CGTTCCTGGA 


CACTGCTCAC 


15180 


GACCTCGGAT 


GTCACCTGTG 


GAGCAGAGCA 


GGTGTACTG6 


TCACTTCCAG 


ACATGATGGA 


15240 


6GACCCTGTC 


ACATTCCGTT 


CCACGAGACA 


AGTCAGCAAC 


TATCCAGTGG 


TAGGTGGAGA 


15300 


GCTCAT6CCA 


GTCTTCTCAA 


AAAGTTTCTA 


CAACGAGCAA 


GCCGTGTACT 


CCCAGCAGCT 


15360 


TCGCCAGTCC 


ACCTCGCTCA 


CGCACGTCTT 


CAACCGCTTC 


CCTGAGAACC 


AGATCCTCAT 


15420 


COGCCCGCCA 


GCGCCCACCA 


TTACCACCGT 


CAGTGAAAAC 


GTTCCTGCTC 


TGACAGATCA 


15480 
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C6GGACCCT6 


CGGTTGCGCA 


GCAGTATCCG 


GGGAGTCCAG 


C6CGTGACOG 


TTACTGAOGC 


15540 


OIGAOGCOGC 


ACCTGCCCCT 


AOGTCTACAA 


GGCCCTGGGC 


ATAGTOGCGC 


OGOGOGTCCT 


15600 


TTGMOCOOC 


ACTTTCTAAA AAAAAAAAAA 


TGTCGATTCT 


TATCTCACCT AGTAATAACA 


15660 


CCSG6TTGGG0 


CC1!GCGC60G 


CCAAGCAAGA 


TGTAOGGMG 


TGCT06GAAA 


CGCTCTACAC 


15720 


AGGACCCT6T 


GCSGAGTGCGC 


GGACACTTCC 


GOGCTCCATG 


QOQCGCCC^C 


AAG0G006TA 


15780 


TCCGCJICTAG 


AAOCACCGTC 


GATGATGTGA 


tcgacx:aggt 


GGTG6CCGAT 


6CTCGTAATT 


15840 


ATACTOCTAC 


TGCACCTACA 


TCTACTGTGG 


ATGCAGTTAT 


TGACAGCGTA 


GTAGCTGA06 


15900 


CCC6C6CCTA 


TGCTCGCCGG 


AAGAGCAGGC 


GGAGACGCAT 


CGCCAGG06C 


CACCGGGCTA 


15960 


CTCCCGCTAT 


GCGAGCAGCA 


AGAGCTTTGC 


TACGGAGAGC 


CAAACGCGTG 


GGGCGAAGAG 


16020 


CTATGCTTAG 


AGCAGCCAGA 


CGCGCGGCTT 


CAGGTGCCAG 


TGCTGGGAGG 


TCCCGCAGGC 


16080 


GC6CA6CCAC 


TCCAGCAGCA 


GCGGCCATTG 


CCAACATGGC 


CCAACCGC»A 


AGAGGCAATG 


16140 


TGTACTGGGT 


GGG06ACGCC 


ACCACCGGCC 


AGCGCGTCCC 


CGTGCGCACC 


GOTCCCCCTC 


16200 


6CTCTTA0AA 


GATACTGAGC 


AGTCTCCGAT 


GTTGTGTCCC 


AGOGAGGATG 


TCCAAGOGCA 


16260 


AATACAAGGA 


AGAGATGCTC 


CAGGTCATCG 


CGCCTGAAAT 


CTACX36T006 


CGGGTGAAGG 


16320 


ATGAAAAAAA 


GCOCOGCAAA 


ATCAAGCGGG 


TCAAAAAGGA 


CAAAAAGGAA 


GAA6ATG60G 


16380 


AT6ATGGTCT 


GGTGGAGTTT 


GTGCGCGAGT 


TCGCCCCAA6 


G06G0GTGT6 


GAGTGGCGT6 


16440 


GA06CAAAGT 


G06GCCTGTG 


CTGAGACCTG 


GAACCACGGT 


GGTCTTTAOG 


CCCGG0GA6C 


16500 


6CTCCA6CAC 


TGCTTTTAAG 


CGGTCCTATG 


ATGAGGTGTA 


TGGGGATGAT 


GATATTCTGG 


16560 


AGCAGGCGGC 


TGACCGCCTG 


GGCGAGTTTG 


CTTATGGCAA 


GOGCTCCCGC 


TCCAGTCCCA 


16620 


AGOAGGAGGC 


GGTGTCCATT 


CCCTTGGACA 


ATGG6AATCC 


CACCCCTAGC 


CTCAAGCCAG 


16680 


TCACCXTFCCA 


GCAAGTGCTG 


CCCGTGCCTC 


CACGCAGAGG 


CATCAAGCGA 


GAGGGTOAGG 


16740 


ATCSTGTATCX? 


GACTATGCAA 


TTGATGGTGC 


CCAAGCGCCA 


60GGCTGGAG 


GACGTGCTGG 


16800 


AGAAAATGAA 


AGTGGATCCC 


GATATACAAC 


CTGAGGTCAA 


AGTGAGACCC 


ATGAAGCAGG 


16860 


TGG06CCAGG 


TTTGGGAGTA 


CAAACCGTAG 


ACATCAAGAT 


TCCAACOGAG 


TCCATGQAAG 


16920 


TCCAAACCGA 


ACCTGCAAAG 


CCCACAACCA 


CCTCCATTGA 


GGTACAAACG 


6ATCCCTOOA 


16980 


T6TGAGCACC 


OGTTACAACT 


CCAGCTGCCG 


TCAACACCAC 


TCGAAGATCC 


OGGOGACAGT 


17040 




AAGTTTGCTG 


ATGCCAAATT 


ATGCTCTGCA 


CCCATCTATT 


ATTCCAACTC 


17100 


C6GGTTACC6 


AGGCACTCGC 


TACTACCGCA 


GCCGGAGCAG 


TACTTCCOGC 


CGTC6CC3GCA 


17160 


AAAGACCTAC 


AAGTCGTAGT 


CACCGTCGTC 


GCCGTCGCCC 


CACCAGCAAT 


CTGACTCCCG 


17220 


CTGCTCTGGT 


GC3GGAGAGTG 


TATCGCGATG 


GCCGCGCGGA 


TCCCATGACG 


TTGCCACGCG 


17280 


TAOGCTACCA 


CCGAAGCATC 


ACAACTTAAC 


GACTGTTGCC 


GCTGCCTCCT 


TGCAGATATG 


17340 


GCCCTCACTT 


6C0GCCTTCG 


TGTCCCCATT 


ACTGGCTACC 


GAGGAAGAAA 


CTGGCGCOGT 


17400 
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AGAAGAGGGA 


TGTTGGGGCG 


TCGGATGCGA 


CGCCACAGGC 


GGCGGOGCGC 


TATCAGCAAG 


17460 


A6GCTGGG6G 


GT6GCTTTCT 


GCCTGCTCTG 


ATCCCCATCA 


TAGCXGCGGC 


GATCGGGGCG 


17520 


ATACCAG6CA 


TAGCTTCCGT GG0GGXTGA6 


GCCTCGCA6C 


GCCACTGACA 


TTGGAAAAAC 


17580 


TTATAAATAA 


AACAGAAT66 ACTCT6ATGC 


TCCTGGTCCT 


GTGACTATGT 


TTTTGTAGA6 


17640 


ATG6AAGACA 


TCAATTTTTC ATOCCTGGCT 


CC6C6ACACG 


GCACGAGGCC 


GTACAT6GGC 


17700 


ACCTG6AGC0 


ACAT066CAC 


CAGCCAACT6 


AAOGGGGGOG 


CCTTGAATTG 


GAGCAGTATC 


17760 


TG6AGCG66C 


TTAAAAATTT 


TGGCTCTACC 


ATAAAAACCT 


ATGGGAACAA 


AGCTTGGAAC 


17820 


AGCAGCACAG 


GGCAGGCACT GAGAAATAA6 


CTTAAAGAAC 


AAAACTTCCA 


ACAGAAGGTG 


17880 


GTTGATGGGA 


TCGCCTCTGG 


TATTAATGGG 


GTGGTGGATC 


TGGCCAACCA 


GGCCGTGCAG 


17940 


AAACAGATAA 


ACAGCCGCCT 


GGACCCGCCG 


CCGTCA6CCC 


CGGGTGAAAT 


GGAAGTGGAG 


18000 


GAAGATCTCC 


CTCCCCTTGA 


AAAACGGGGC 


GACAAGCXSTC 


CGCGCCCCGA 


TCTGGAGGAG 


18060 


ACACTAGTCA 


CACGCTCAGA 


CGACCCGCCC 


TCCTACGAGG 


AGGCAGTGAA 


GCTTGGAATG 


18120 


CCCACCACCA 


GGCCTGTAGC 


CCCCAT6GCT 


ACCGGGGT6A 


TGAAACCTTC 


7CAGTCACAC 


18180 


C6ACCCGCTA 


CCTTGGACTT GCXTPCCTCCC 


CCTGCTGCTG 


CAGCGCCTGC 


TCGCAAGCCT 


18240 


GTCGCTACCC 


OGAAGCCCAC 


CAC0GTACA6 


CCOGTCGCCG 


TAGCCAQACC 


GCGTCCTGGG 


18300 


6GCGGCCCAC 


GACOGAATTC 


AAAC7GGCAG 


AGTACTCTGA 


ACAGCATCGT 


TGGTCTGGGC 


18360 


GTGCAAAGTG 


TAAAAOGCOG 


TCGCTGCTTT 


TAAATTAAAT 


ATG6AGTAGC 


GCTTAACTTG 


18420 


CCTGTCTGTG 


TGTATGTGTC 


ATCATGAOGC 


CGCTGCCGCA 


GCAACAGCAG 


AGGAGAAAGG 


18480 


AAGAGGTCGC 


6C6CCGAGGC 


TGAGTTGCTT 


TCAAGAT66C 


CACCCCATCG 


ATGCTGCCCC 


18540 


AGTGGGCATA 


CATGCACATC 


GCCGGACAGG 


ATGCTTCGGA 


GTACCTGAGT 


CCGGGTCTGG 


18600 


TGCAGTTCGC 


CCGCGCCACA 


GACACCTACT 


TCAATCTGGG 


GAACAAGTTT 


AGGAACCCCA 


18660 


CCGTGGC6CC 


CACCCATGAT 


GTGACCACCG 


ACCGCAGTCA 


GCGGCTGATG 


CTGCGCTTTG 


18720 


TACCOGTTGA 


CCGGGA6GAC 


AATACCTACT 


CATACAAAGT 


TCGATACACC 


TTGGCTGTGG 


18780 


6CGACAACAG 


AGTGCTGGAT 


ATGGCCAGCA 


CTTTCTTTGA 


CATTCGGGGT 


GTGTTGGATA 


16840 


GAGGCCCTAG 


CTTCAA6CCA 


TATTCT6GCT 


CT6CTTACAA 


CTCATTGGCC 


CCTAAGGGCG 


18900 


CTCCCAATAC 


ATCTCAGTGG 


CTTGATAAGG 


GAGTCACAAC 


CACT6ATAAT 


AATACTGAAA 


18960 


A06GAGATGA 


AGAAGAT6AA 


GTTGCCS6AGG 


AAGGGGAAGA 


AGAAAAACAA 


6CTACATACA 


19020 


CTTTT6GCAA 


TGCXvCCAGTA 


AAA6C0GAAG 


CTGAAATTAC 


AAAA6AAGGA 


CTGCCAATAG 


19080 


GTTTGGAA6T 


TCCATCTOAA 


GGT6ACCCTA 


AACCCATTTA 


TGCTGATAAA 


CTGTATCAGC 


19140 


CAGAACCTCA 


GGTGGGAGAG 


GAATCTTGGA 


CTGATACGGA 


TGGCACA6AT 


GAAAAATATG 


19200 


GAGGCAGAGC 


ACTTAAACCT 


GAAACTAAAA 


TGAAACCCTG 


CTACGGGTCT 


TTTGCTAAAC 


19260 


CTACTAATGT 


TAAAGGCGGC 


CAAGCAAAAG 


TGAAGAAAGT 


AGAAGAAGGC 


AAGGTTGAAT 


19320 
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ATGACATTGA 


CATGAACTTT 


TTCGACCTAA 


GATCACAAAA 


GACTGGTCTC 


AAGCCTAAAA 


19380 


TT6TAATGTA 


TGCA6AAAAT 


GTGGATCTAG 


AAACTCCAGA 


CACTCATGTG 


GTGTACAAAC 


19440 


CTG6A6CTTC 


AGATGCTAGT TCTCATGCAA 


ACCTTGGTCA ACAGTCCATG 


CCCAATAGAC 


19500 


CTAACTATAT 


T6GCTTCA6G 


GACAACTTCA 


TOGGACTCAT 


GTACTATAAC 


AGTACTGGCA 


19560 


ACATGG6AGT 


GCTGGCTGGA CAAGCGTCTC 


AGCTAAATGC AGTGGTTGAC 


TTGCAAGACA 


19620 


6AAACACA6A 


ATTGTCATAT 


CAACTCTTGC 


TTGATTCTCT GGGAGACAGA 


ACCAGATATT 


19680 


TCA6CATGT6 


GAATCAAGCA 


6TGGATAGCT 


ATOACCCAGA 


TGTGOGTGTT 


ATTGAAAACC 


19740 


ATGGTGTGGA 


AGAT6AACTT 


CCCAACTATT 


GTTTTCCATT 


6GACGGTGTA 


GGTCCGCGAA 


19800 


CA6ACAGTTA 


CAAGGGAATT 


GAGACAAATG 


GTGACGAAAA 


CACTACTTGG 


AAAGATTTAG 


19860 


ATCCAAATGG 


CATAAGTGAA 


CTTGCTAAGG 


GAAATCCATT 


TGCCATGGAA 


ATCAACATCC 


19920 


AAGCTAATCT 


CTGGAGAAGT 


TTCCTTTATT 


CCAACGTGGC 


CCTCTATCTC 


CCAGACTCGT 


19980 


ACAAATACAC 


TCGAACCAAT 


GTTACTCTCC 


CAGAAAACAA AAACACCTAT 


GACTACATGA 


20040 


ATGGGCGG6T 


GGTTCCCCCC 


TCCCTGGTGG 


ATACCTACGT 


AAACATTGGC 


GCCAGATGGT 


20100 


CTTT6GATGC 


CAT6GACAAC 


GTCAACCCCT 


TCAACCATCA 


CCGCAACGCT 


GGCCTGCGAT 


20160 


ACCGGTCCAT 


GCTTCTOGGC 


AATGGTCGCT 


ACGTGCCTTT 


CCACATTCAA 


GTGCCTCAGA 


20220 


AATTCTTTGC 


TGTGAAAAAC 


CTGCTGCTTC 


TACCTGGTTC 


TTACACCTAC 


GAGTGGAACT 


20280 


TCAGAAA6GA 


TGTGAACATG 


GTCCTGCAGA 


GTTCCCTTGG 


CAATGATCTC 


CGAGTTGATG 


20340 


GCGCCAGCAT 


CAGTTTTACC 


AGCATCAATC 


TCTATGCCAC 


CTTCTTCCCC 


ATGGCCCACA 


20400 


ACACTGCCTC 


CACCCTTGAA 


GCCATGCTGC 


GCAACGACAC 


CAATGATCAA 


TCATTCAATG 


20460 


ACTACCTTTC 


TGCAGCTAAC 


ATGCtCTACC 


CCATCCCTGC 


CAATGCTACC 


AACGTTCCCA 


20520 


TCTCCATTCC 


CTCTCGCAAC 


TGGGCCGCCT 


TCAGGGGCTG 


GTCCTTTACC 


AGACTGAAAA 


20580 


CCAAGGAGAC 


TCCCTCTTTG 


GGATCAGGGT 


TCGATCCCTA 


CTTTGTTTAC 


TCTGGTTCTA 


20640 


TACCCTACCT 


GGATGGTACC 


TTCTACCTCA 


ACCACACTTT 


CAAGAAAGTC 


TCTATCATGT 


20700 


TTGACTCTTC 


AGTCAGCTGG 


CCTGGTAATG 


AGAGATTGCT 


AACTCCAAAC 


GAGTTOGAAA 


20760 


TCAAGC6CAC 


AGTTGATGGG 


GAAGGCTACA 


ATGTGGCCXZA 


ATGTAACATG 


ACCAAAGACT 


20820 


GGTTTCT6GT 


CCAGATGCTT 


GCCAACTACA 


ACATTGGATA 


CCAGGGTTTC 


TATGTTCCTG 


20880 


AGGGTTACAA 


GGATCGCATG 


TATTCCTTCT 


TCAGAAACTT 


CCAGCCCATG 


AGTAGACAGG 


20940 


TGGTTGATGA 


GATTAACTAC 


AAAGACTATA 


AAGCTGTCGC 


CGTACCCTAC 


CAGCATAATA 


21000 


ACTCTGGCTT 


TGTGGGTTAC 


ATGGCTCCTA 


CCATGCGTCA 


GGGTCAAGCG 


TACCCTGCTA 


21060 


ACTACCCATA 


CCCCCTAATT 


GGAACCACTG 


CAGTAACCAG 


TGTCACCCAG 


AAAAAATTCC 


21120 


TGTGTGACAG 


GACCATGTGG 


CGCATCCCAT 


TCTCTAGCAA 


CXTCATGTCC 


ATGGGTGCCC 


21180 


TTACAGACCT 


GGGACAGAAC 


TTGCTGTACG 


CCAACTCAGC 


CCATGCX;CTG 


GACATGACTT 


21240 
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TTGAG6T6GA 


TCCCATGGAT 


GAGCCCACCC TGCTTTATCT TCTTTTCGAA 


GTATTOGACG 


21300 


TGGTCAGAGT 


GCACCAACCA 


CATCGCGGCG TCATCGAGGC CGTCTACCTG 


CGCACACCGT 


21360 


TCTCGGCTGG 


TAACGCCACC 


ACATAA6AAA CCTGCTTCTT GCAAGGGGCA 


GCCATGACCT 


21420 


GOGTGACCGG 


AAA06GCTCC 


AGCGAGCAAG AGCTCAGAGC CATCGTCOGA 


GACCTTGGCT 


21480 


GTGGACCCTA 


TTTTCTGGGA 


ACCTTTGACA AAOGCTTCCC GGG6TTTATG 


GCTCCAGACA 


21540 


AGCTGGCCTG 


CGCCATTGTC 


AACACAGCCG GTOGCGAGAC GGGGGGAGAG 


CACTGGXTGG 


21600 


CTTTTGGTTG 


GAACCCGCGC 


TCCAACACAT GCTACCTTTT TGATCCATTT 


GGATTCTCGG 


21660 


ATGACOGTCT 


TAAGCA6ATC 


TACCAGTTTG AATACGAGGG GTTGCTGCGC 


CGTAGCXSCCC 


21720 


TTGCTACTAA 


GGATCGCTGC 


ATTACCCTGG AAAAGTCCAC CCAAACAGTG 


CAGGGTCCGC 


21780 


GCTCCGCCGC 


TTGTGGACTT 


TTTTGCTGCA TGTTTCTCCA TGCCTTTGTA 


CACTGGCCAG 


21840 


ACCGTCCCAT 


GGACGGTAAC 


CCCACCATGA AGTTGCTTAC GGGAGTGCCC 


AACAACATGC 


21900 


TCCAGTCACC 


CCAAGTCCAG 


CCCACCCTGC GCAGGAACCA GGAGGOGCTC 


TACCATTTCC 


21960 


TGAACACACA 


TTCATCTTAC 


TTTCGTTCTC ACOGCGCACG TATCGAAAGG 


GCTACTGCGT 


22020 


TOGATOGTAT 


GGGATAATAT 


AAGTCATGTA AAACCGTGTT CAATAAACAG 


AACTTTATTT 


22080 


TTTACATACA 


CT6GTGGTTT 


6CTCATTTAT TOGCTCAGAA GTCGAAGGGG 


TTTTGGCGGG 


22140 


AATCAGAGTG 


ACCCGOGGGC 


AGGGATACGT TTOGGAACTG GAACTGAGCT 


TGCCACTTGA 


22200 


ATTCGGGGAT 


CACCAGCTTG 


GGAACTGGCA GGTCAGGCAG GATGTCGCTC 


CACAGCTTCC 


22260 


GGGTTAGTTG 


CAGGGCTCCC 


AACAGGTCAG GGGCTGAAAT CTTAAAATCG 


CAATTGGGAC 


22320 


CCGTGCTCTG 


AGCGCGGGAG 


TTGCGATACA CA6GGTTGCA ACACTGGAAC 


ACCATAAGC6 


22380 


ACGGGTATTT 


CACACTCGCC 


AGCACAGTGG GGTCGGTGAT AATTCCCACA 


TCCAGGTCTT 


22440 


CGGCATTGGC 


CATGCTAAAG 


GGGGTCATCT TGCAAGTCTG TCTGCCCATA 


GTCGGTACCC 


22500 


AGCCTGGCTT 


GTGGTTGCAA 


TCGCAGCGCA GAGGGATTAG CATCATCTTG 


GCCTGGTCGG 


22560 


ATCTCATACC 


TGGATACACA 


GCTTTCATGA AAGCTTCATA TTGTTTGAAA 


GCCTGTTGGG 


22620 


CCTTGCTACC 


CTCAGTGTAG 


AACATCCCAC ATGACTTGCT AGAGAACTGG 


TTGGCAGCAC 


22680 


ACC0G6CATC 


ATTCACACAA 


CAGCGAGCGT CGTTGTTGGC TATTTGTACC 


ACACTCCTGC 


22740 


CCCAGCGGTT 


CTGGGTGATC 


TTGGTTGGCT CAGGGTTCTG CTTCAGCGCC 


CGTTGACCGT 


22800 


TTTCGCTTGC 


CACATCCATT 


TCTATGATAT GCTCCTTGTG GATCATGATG 


TTGCCATGCA 


22860 


AAGACTTTAG 


CTTGCCTTCA 


TAATCATTAC ATCCATGTGA CCACAACGCG 


CATCCCGTAC 


22920 


ACTCCCAGTT 


ATTGT6AGCG 


ATCTCAGAAT AGGAGTGCAC CAACCCCTGC 


AGGAATCTTC 


22980 


CCATCATGGT 


TGAGAGGGTC 


TTGTTACTGG TGAAAGTCAA CGGGACGCCT 


CGATGCTCCT 


23040 


CATTCACATA 


CTGGTGGCAA 


ATTCGCTTGT ACTGTTCATG CTGCTCTGGC 


ATAAGCTTGA 


23100 


AAGAGGTTCT 


TAGGTCATTC 


TCCAGCCTGT ACTTCTCCAT CAGCACAGCC 


ATTACTTCCA 


23160 
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TGCCCTTTTC 


CCAGGCA6AA 


ACCAGGGGTA 


GGCTCATGGC 


ATTTCTAACA 


GAAATAGCAG 


23220 


CTACTTTA6C 


CAGAGGGTCA 


TCCTTGTCGA 


TCTTCTCAAC 


ACTTCTTTTG 


CCATCCTTCT 


23280 


CAGTGAT6CG 


CACGGGTGG6 


TAGCTGAAGC 


CCACAGCCAC 


CAGCTCOGCC 


TCTTCTCTTT 


23340 


CTTCTTCGCT 


GTCCTGACTG 


AT6TCTT0TA 


AAGGGACATG 


CTTGGTCTTC 


CTGGACTTCT 


23400 


TTTT6G60G6 


TATT6GCGGA 


GGGCTGCT6C 


TCCGCTCCGG 


AGACATGGAG 


GACCGCX^G 


23460 


TTTG6CTGAC 


CAGTACCACC 


TGGCTCTOGG 


TAGAA6AACC 


GGACCCGACA 


OGGCGGTAGG 


23520 


TGTTCCTCTT 


OGGGGGCAGA 


GGCGGA6GTG 


ACTGOGATGG 


GCTGCGGTCT 


GGCCTGGGAG 


23580 


GOGGATGACT 


GGCAGAGCCC 


CTTCCGCGTT 


CGGGGGTGTG 


CTCCOGGT6G 


CGGTOGCTTG 


23640 


ACTGATTTCC 


TCCGCGGCT6 


GCCATTGTGT 


TCTCCTAGGC 


AGAGAAAACA 


ACAGACATGG 


23700 


AGACTCAGCC 


ATCGCTGCCA 


ACACCGCTGC 


AAGCACCATC 


ACACCTCGCC 


TCCAGCGACG 


23760 


AG6AGGAGGA 


ACAAAGCTTA 


ACCGCCCCAC 


CACCCAGTCC 


CGCCACCACC 


ACCTCTACCC 


23820 


TCGAGGATGA 


GGAGGTCGAC 


GCACCCGAGG 


AGATACAGGC 


GCAGGATATG 


AAGGATGAGA 


23880 


AAGCGGAAGA 


6ATTGAGGCA 


GATATCGAGC 


AGGACCCAGG 


CTATGTGACA 


COGGCCGAGC 


23940 


AOGAGGAAGA 


GCTGAGACGC 


TTTCTA6A6A 


AAGATGATGA 


CAACOGTCCA 


GAACAGCAAG 


24000 


CAGAT6GC6A 


TCAACAGAAG 


GCTGGGCTCG 


GTGGTCATGT 


TGCOGACTAC 


CXCACCGGCC 


24060 


TTG6TGGGGA 


GGATGTGCTC 


CTCAAACACC 


TAGCAAGGCA 


GTCGATCATA 


ATCAAAGACG 


24120 


CACTGCTT6A 


TCGCAGCGAA 


GTGCXrCATCA 


GTGTGGAAGA 


GCTCAGCCGC 


6CCTAC6AGC 


24180 


TC3AATCTGTT 


CTCGCCTCGG 


GTACCCCCCA 


AGCGTCAGCC 


AAACGGCACC 


TGCGAGCCCA 


24240 


ACCCTCGCCT 


CAACTTCTAT 


CCCGCATTCA 


CCGTCCCCGA 


AGTGCTGGCC 


ACCTACCACA 


24300 


TATTTTTTAA 


AAACCAAAAA 


ATCCCAATTT 


CCTGCCG06C 


CAACCGAACT 


CGCGCCGATG 


24360 


CCCTGTTCAA 


CTTGGGACCT 


GGC6CTTGCT 


TACCTQATAT 


AACTTCCTTG 


GAAGAGGTCC 


24420 


CAAAGATCTT 


OGAAGGTCTG 


GGCAGTGATG 


AGACTCGGGC 


CGCAAATGCT 


CTGCAACAGG 


24480 


6A6AGAGT6G 


CATTGATGAA 


CATCACAGCG 


CTCTGGTGGA 


GTTGGA6GGC 


OATAATGCCC 


24540 


GACTTGCAGT 


ACTCAAGCGC 


AGTATCGAA6 


TGACCCATTT 


TGCATACCCC 


GCTGTCAACC 


24600 


TGCCTCCCAA 


AGTGATGAGC 


GCTGTCATGG 


ATCAGATACT 


CATTAAACGC 


GCAAGTCCCC 


24660 


TATCAGAAAA 


CATGCAGGAT 


CCAGACGCCT 


CGGATGAGGG 


CAAACCAGTG 


GTCAGTGATG 


24720 


AACAGCTATC 


TCGCTGGCTG 


GGCACCAACT 


CCCCACTAGA 


CTTGGAAGAG 


OGGCGCAAGC 


24780 


TCATGATGGC 


CGTGGTGCTA 


GTTACTGTGG 


AAATGGAGTG 


TCTTCGCCGC 


TTCTTCACTG 


24840 


ACCCCGAGAC 


ATTGCGCAAG 


CTCGAGGAGA 


ACCTGCACTA 


CACTTTTAGA 


CATGGATTTG 


24900 


TGCGACA6GC 


ATGCAAGATC 


TCCAACGTGG 


AGCTTACGAA 


CCTGGTTTCC 


TACATGGGCA 


24960 


TTTTGCATGA 


AAACAGACTC 


GGACAGAGCG 


TGTTGCACAC 


CACCCTGAAG 


GGTGAAGCCC 


25020 


GTCGCGACTA 


CATCCGCGAC ACTGTCTACC 


TCTACCTCTG 


CCATACCTGG 


CAGACTGGTA 


25080 
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TGGGTGTGTG 


GCAGCAGTGT 


TTGGAAGAAC 


AGAACCTGAA 


AGAGCTTGAC 


AAGCTCTTAC 


25140 


AAAGATCCCT 


CAAATCCTTG 


TGGACGGGTT 


TTGACGAGCG 


CACAGTCGCC 


TCTGATCTGG 


25200 


CAGATCTCAT 


CTTCXSCCGAG 


CGTCTCAGGA 


CCACTCTGCG 


CAACGGGCTG 


CCTGACTTCA 


25260 


T6AGCCAGAG 


CATGCTTAAC 


AACTTTC6CT 


CTTTCATCCT GGAACGCTCC 


GGTATCCTGC 


25320 


CCX3CCACCTG 


CTGTGCGCTA 


CCATCCGACT 


TTGTGCCTCT 


GAOCTACCGC 


GAGTGCCCAC 


25380 


CACCGCTATG 


GAGCCACTGC 


TACCTGTTCC 


GCCTGGCCAA 


CTACCTATCA 


TACCACTCGG 


25440 


ATGTGATCGA 


GGAT6TGAGC 


6GAGATGGCC 


TGCTTGAGT6 


CCACT6CCGC 


TOTAATCTCT 


25500 


GCTCACCACA 


TCGCTCCCTC 


GTCTGTAACC 


CCCAGCTGCT 


TAGTGAAACC 


CAAATTATCG 


25560 


GCACCTTCGA 


ATTGCAGGGT 


CCCAGCGGCG 


AAGGCGATGG 


GTCTTCTCCT 


GGGCAAAGTT 


25620 


TGAAACTGAC 


CCCGGGACTG 


TGGACCTCCG 


CCTACCTGCG 


CAAGTTCTCC 


CCCGAGGACT 


25680 


ACCACCCCTA 


TGAGATCAGG 


TTCTATGAGG 


ACCAATCACA 


GCCXSCCCAAA 


GCCGAGCTAT 


25740 


CAGCATGCGT 


CATCACCCAG 


GGGGCAATTT 


TGGCCCAATT 


GCAAGCCATC 


CAAAAATCCC 


25800 


GCCAAGAATT 


TTTGCTGAAA 


AAGGGTAACG 


GAGTCTACCT 


CGACCCCCAG 


ACTGGTGAGG 


25860 


AGCTCAACAC 


AAGGTTTTCT 


CAGGATGTCT 


CAGOGCOGAG 


GAAGCAA6AA 


GTTGAAA6TG 


25920 


CAGCTGCC6C 


CCCGAGAGOA 


TATGGAGGAA 


GACTG6GACA 


GTCAGAGAGA 


GGAGATGGAA 


25980 


GATTGGGACA 


GCCAGGCAGA 


GGAGGAGGAG 


GACAGCCTGG 


AGGAAGACAG 


TTTGGAGGAG 


26040 


6AAGACGAGG 


AGGCAGAGGA 


GGTGGAAGAA 


GCAACCGCCX3 


CCAAACAGTT 


GTCCTOGGCA 


26100 


GCGGAGACAA 


GCAAGGCCAC 


AGACAGTACC ACAGCTACCA TCTCCGCTCC 


GGGTCGGGGG 


26160 


GCCCAGCACC 


GTCCCAACAG 


TAGATGGGAT 


GAGACCGGGC 


GACTCCCGAA 


TGGGACCACC 


26220 


GCTTCTAAGA 


CTGGTAAGGA 


GCGGCAGGGA 


TACAAGTCCT 


GGCGGGGGCA 


TAAGAACGCT 


26280 


ATCATATCCT 


GCTTGCATGA 


ATGCGGGGGC 


AACATATCCT 


TCACCCGCCG 


CTACCTGCTC 


26340 


TTCCACCACG 


GGGTGAACTT 


CCCCCGCAAT 


GTCTTGCATT 


ACTACOGTCA 


CCTCCACAGC 


26400 


OCCTATTACA 


GCCCGCAA6T 


CTC6GCAGAA 


AAAGACAACA 


GCAGCAAGGA 


CCTCCAGCAG 


26460 


AAAACCAGCA 


GCAGTTAGAA 


AACCCACAGC 


AGGTGCA6GA 


GGACT6AGAA 


TCACAGCGAA 


26520 


CGAGCCAGCG 


CAGACCCGAG 


AGCTGAGAAA 


CCGGATTTTT 


CCAACGCTCT 


ATGCCATCTT 


26580 


CCAACAGA6T 


CGGGGGCAAG 


AGCAGGAACT 


GAAAGTAAAA 


AACCGATCTT 


TGCGCTCGCT 


26640 


CACC06AAGT 


TGTTTGTATC 


ACAAGAGC6A 


A6AGCAACTT 


CAGOGCACTC 


TCGAGGACGC 


26700 


06AGGCTCTC 


TTCAACAAGT 


ACT60G0GCT 


CACTCTTAAA 


GAGTAGCCCG 


CGCCCGCGCT 


26760 


ATCTCGAAAA 


AGGCGGGAAT 


TACGTCACCC 


TTGGCGCCCO 


TCCTTTGCCC 


TCGTCATGAG 


26820 


TAAAGAAATT 


CCCACGCCTT 


ACATGTGGAG 


TTATCAGCCC 


CAAATGGGAC 


TGGCAGCAGG 


26880 


CX5CCTCCCAG 


GACTACTCCA 


CCCGTATGAA 


TTGGCTCAGC 


GCCGGTCCCT 


CGATGATCTC 


26940 


ACGGGTTAAT 


GATATACGAG 


CTTATCGAAA 


CCAATTACTC 


CTAGAACAGT 


CAGCACTTAC 


27000 
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CACCACCCCC 


AGACAACACC 


TTAATCCCCG 


AAATTGGCCC 


GCCGCCCTGG 


TGTACCA6GA 


27060 


AACCCCC6CT 


CCCACCACOG 


TACTACTTCC 


TCGAGACX^CC 


CAGGCOGAAG 


TTCAGATGAC 


27120 


TAA0GCAG6T 


GTACAGCTGG 


06GG0GGTTC 


OGCXSCTTTGT 


OGTCACCGGC 


CTCAACAOAG 


27180 


TATAAAAG6C 


CTGOTGATCA 


GAG6CCGA6G 


TATCCAGCTC 


AACGAOGAGT 


CGGTGAGCTC 


27240 


TTCGCTT6GT 


CTGC6ACCAG 


ACXK3AGTCTT 


CCAAATTGCC 


GGCTGCGG6A 


GATCTTCCTT 


27300 


CACTCCTCGT 


GAGGCTGTAC 


TGACTTTGGA GAGTTCGTCC 


TCACAGCCCC 


GCTCG6GTGG 


27360 


CA70666ACT 


CTCCAGTTTG 


TGGAGGAGTT 


TACTCCCTCT 


GTCTACTTCA 


ACCCCTTCTC 


27420 


CGGATCTCCT 


GGGCATTACC 


CGGAOGAGTT 


CATACCAAAT 


TTCGACGCAA 


TCAGCGAGTC 


27480 


AGT66AT6GT 


TATGATTGAT 


GTCTAATGGT 


GGCGCGGCTG 


AGCTAGCTCG 


ACTGCGACAT 


27540 


CTAGACCACT 


GCCGCCGCTT 


TCGCTGCTTT 


GCCAGAGAAC 


TCACCGAGTT 


CATCTACTTC 


27600 


6AAATACCCG 


A6GAGCACCC 


TCAGGGACC6 


GCCCACGGAG 


TGCGTATTAC 


CATCGAA6GG 


27660 


GGTATAGACT 


CT06CCTGCA 


XCGAATCTTC 


TGCCAGC6GC 


COGTGCTAAT 


CGAGCGCGAC 


27720 


CAGGGAAACA 


CCACAGTCTC 


CATCTACTGC 


ATCTGTAACC 


ACCCCGGATT 


GCATGAAAGC 


27780 


CTTT6CTGTC 


TTATTTGTGC 


TGAGTTTAAT AAAAACTGAG 


TTAAGACTCT 


CCTACGGACT 


27840 


ACCAATTCTT 


CAACCCGGAC 


TTTATAACAA 


TCAGACCXITC 


CTACCAAGTC 


AGAAGACCCC 


27900 


AACCCTTCCT 


CTGATCCAGG 


ACTCTAATTC 


TACCTCCCCA 


GCACCATACT 


TTACTAGCCT 


27960 


TCC06AAACT 


AACAACCTCG 


6AGCTAAACT 


GCACC6CTTT 


TCCAGAAGCC 


TTCTCTCTGC 


28020 


CAATACTACC 


ACTCCCAGAA 


CCGGAGGTGA 


GCTCCGTAGT 


CTTCCTAATA 


ACAACCCCTG 


28080 


GGTGGTAACT 


GGGTTTGTAA 


CATTAGGTGT 


AGTTGOGGGT 


GGGCTTGTGC 


TTATCCTTTG 


28140 


CTACCTATAC 


ACACCTTGCT 


6TGCTTATTT 


AGTAATCTTG 


TGTTGCTGGT 


TTAAGAAATG 


28200 


GGGGCCCTAC 


TAGCCGCGCT 


TGCTTTACTT 


TCACTTTTTG 


AGCCTGGCTC 


TACTATGCTA 


28260 


GTTCAGCCTG 


TACTATTTGA 


TCCATGCCTC 


AATTTTGATC 


CAGACAACTG 


CACACTCACT 


28320 


TTTGCTCCAG 


AGGCTGGACG 


CTGTGGAGTT 


CTTATTAGGT 


GCGGACGGGA 


ATGCAGTCCC 


28380 


ATTGAAATAC 


ACCACAATAA 


CAAACTTTGG, 


, AACAAT ACCT 


TATTCACGAC 


ATGGGAGCCA 


28440 


GGAGACCXTTG 


AGTGGTATAC 


TGTCTCTGTC 


CGTGGTCCTG 


AOGGTTCCAT 


CCGCACTGCT 


28500 


AATAACACTT 


TTATTTTTGC 


TGAGAT6TGC 


GATCTGACCA 


TGTTCATGAG 


CAAACAGTAT 


28560 


AACCTATGGC 


CTCCAA6CAA 


GGA6AACATT 


GTGGCATTCT 


CCCTTGCTTA 


TTGCTTGTGT 


28620 


ACGTGTCTCA 


TTACTGCTAT 


TCTGTGTATC 


TGCATACACT 


T6CTTATTGC 


CACTCGCCAC 


28680 


AGAAACAGCA 


ATAAGGAAAA 


AGAGAAAATG 


CCTTGAGCTT 


TTTCTCATCT 


ATGTTTTTTT 


28740 


TTTTTGTTAC 


AGACATGGCT 


TCAGTTATAG 


CTCTAATTAT 


TGCCAGCATT 


CTCACTGCCG 


28800 


CACACGGACA 


AACAATTGTC 


TATATTACCT 


TAGGTCATAA 


CCACACTCTT 


ATAGGACCCC 


28860 


AAATTAGTTC 


ACAGGTTATA 


TGGACCAAAC 


TTGGAAGTGT 


TGATTATTTT 


GACATAATCT 


28920 
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GCAACAGAAC 


TAAACCAATA 


TTTGTAACCT 


GTAACAAACA 


AAATCTCACC 


TTAATCAATG 


28980 


TTAGCGAAAT 


TTACAACGGT 


TACTATTATG 


GTTATGACAG 


ACACA6CAGT 


GAATATAAAA 


29040 


ATTACTTAGT 


TCGCATAACT 


CAACCCAAAA CTACAAAAAT 


GCCAAATATG 


GCAAAAA7TC 


29100 


AAATGGTTAG 


GACATTAGAA 


AATCTTTCAT ATCCCACCAC ACC06ATGAG 


AAAAACATTC 


29160 


CAAATTCAAT 


GATTGCCATT 


ATTGCGGCGG 


TG6CAGTGGG 


AATGGCACTA 


ATAATAATTT 


29220 


6TAT6TTCCT 


ATATGCTT6T 


TACTGTAGAA 


AGTTTCACAA 


ACAG6ACCCC 


CTACTAAATT 


29280 


TTTGACATTT 


AATTTTTTAT 


ACAGCTATGG 


TTTCCACTAC 


AGCCTTTTTT 


ATTATCAGTA 


29340 


GCCTTGCAGC 


TGTCACTTAT 


GGTOGCTCAC 


ACCTCACTGT 


AACTGTTGGC 


TCAACTTGTA 


29400 


CACTACAAGG 


ACCCCAAGAA 


GGGCAT6TGA 


GTTGGTGGAG 


AATATATGAT 


AGTGGATGGT 


29460 


TCATTAGGCC 


ATGTGACCAG 


CCTGGTAACA 


AATTTCTCTG 


CAACGGGAGA 


GACCTGACCA 


29520 


TTATTAACAT 


AACA6TAAAT 


GACCAGGGCT 


TCTATTATGG 


AACTAACTAT 


AAAAATAACT 


29580 


TAGATTACAA 


CATTATCGTA 


GTGCCAGCCA 


CCACTCCAGC 


TCCCCGCAAA 


ACCACTTTCT 


29640 


TTAGCAGCAG 


TGCCAGTATT 


TCTAAAACAG 


CTTCTGCAAT 


CTTAAAGCTT 


CAAAAAATCG 


29700 


CTTTAAGTAA 


TTCCACAACC 


TCTTCCACTA ACACAACXITC 


TAAATCAGTA 


GTCGGCATCG 


29760 


CT6TTGC0GC 


GGTAATGGGA 


TTAAT6ATTA 


TAACTTTGTG 


CATAATCTAC 


TACGCCT6CT 


29820 


6CTATAGAAA 


ACATGAACAA 


AAAAGC6ATC 


CCTTGCTGAA 


7TTT6ATATT 


TAATTTTTTT 


29880 


TTATAGAATC 


ATGAAAAAAC 


TAATTATCCT 


AGCTTTTATT 


TTGTTTCAAT 


CATATAOCAC 


29940 


TAACACTACC 


AATGTGCAGA 


CTACTTTAAA 


TCATAGTATG 


GAAAACCACA 


CTACCTCTTA 


30000 


TAAGCACACA 


AACATCACTA 


CCCATCAGCC TAAATATGCT AT6CAACTAG 


AAATCACAAT 


30060 


ACTAATTGTG 


ATTGCAATAC 


TTATCATATC 


TATCATTTTC 


TATTTTACCC 


TATGCCGCCA 


30120 


AATACCCAAT 


ATTCATAGAA 


AAAGACGTCC 


CATTTATTGC 


CCCATGATTA 


GTCAACCCCA 


30180 


TATGACTCTA 


AATGAAATCT 


AAGATCTATT 


CTTTCTCTTT 


TTTACAGTAT 


GGTGAACACC 


30240 


AATCATGATT 


CCTAGAAATT 


TCTTCTTCAC 


CATACTCATC 


TGTGCTTTTA 


ATGTCTGTGC 


30300 


CACCTTTACA 


GCAGTAGCGA 


CTACAAGCCC 


CGACTGTATA 


GGACCATTTG 


CCTCATACAC 


30360 


ACTTTTTGCT 


TTTGTCGCTT 


GCACCTCCGT 


GT6TAGCGTA 


GTCTGCCTGG 


TTATTAATTT 


30420 


TTTTCAACTT 


GTAGACTGGA 


TCTTTGT6AG 


ACTTGCCTAT 


CTGCGTCACC 


ATCCCGAATA 


30480 


CCGCAATCAA 


CATGTTGCGG 


CACTTCTCAG 


ACTTATTTAA AACGAT6CAG 


GCTATACTAC 


30540 


CAGTCATTCT 


6CTTCTGTTG 


CTCCCCTGCG 


ATGOCTTAAC 


CCCC6TCGCT 


AATCGTACCC 


30600 


CACCTGAACA 


ACTTAGAAAA 


TGCAAATTCC 


AACAACCATG 


GACATTCCTT 


GATTGCTATC 


30660 


6AGAAAAATC 


TGATTTCCCC 


ACATACTGGA 


TTATGATCAT 


TGGAATTGTT 


AATCTAGTTT 


30720 


CTTGCACACT 


ATTCTCTTTC 


CTTGTTTATC 


ATTTTTTTGA 


TTTTGGATGG 


AATGCCCCCA 


30780 


ATGCACTCAC 


TTACCCACAA 


GAACCAGAGG 


AACATATCCC 


ACTACAGAAC 


ATGCAACAGC 


30840 
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CAATAOCTTT 


AATAGATTAT 


GACAATGAGC 


CACAGCCCTC 


GCTGCTTCCT 


GCTATTAGTT 


30900 


ACTTCAACCT 


AACCGGTGGA 


GATGACTGAC 


CCACTCGCCG 


CCTCCACTGC 


TGCCGAGGAA 


30960 


CT6CTT6ATA 


TGGACGGCCG 


CACCTCA6AA 


CAGCGACTCG 


CCCAACTACG 


GATACX3CCAG 


31020 


CA6GAG6AAC 


GTGCCGCCAA GGAGCTCAGG 


6ATGCTATTG 


AAATTCACCA 


GTGCAAAAAA 


31080 


GGCATATTCT 


GTCTGGTGAA 


ACAAGCCAAG 


ATTTCCTAOG 


AGATCACCAC 


TACTGACCAT 


31140 


C6CCTCTGAT 


AC6A6CTCGG 


TCCGCAGCGG 


CAAAAATTCA 


CGTGTATGGT 


GGGAATCAAC 


31200 


CCCATA6TCA 


TTACCCAGCA 


GGCTGGAGAT 


ACTAA6GGTT 


GCATOCACTG 


TTCCTGCGGT 


31260 


TCCACCGA6T 


6CATCTACAC 


CCTACTTAAG 


ACCCTCTGOG 


GCCTTC6AGA 


CATCCTACCC 


31320 


ATGAACTGAT 


CAACTTTCCT 


TCCCCCCSATT 


CAAAAAACAA 


TAATAAAAAT 


CACTTACTT6 


31380 


AAATCAGCAA 


TCATGTCTCC 


GTCCAAATTT 


TCTCCTAGCA 


6CACCTCACT 


TCCCTCTTCC 


31440 


CAACTCTGGT 


ACTCTAAACC 


CCGCCTGGCA 


GCATACTTTC 


TCCACACTTT 


AAATGGAATG 


31500 


TCAAATTTTA 


GTTCCTCTTT 


TCTACCCACA 


ATCTTCATCT 


CTTTATTCTC 


CCCAGATGGC 


31560 


GAAAC6AACT 


CGGTTGAGCA 


GCTCCTTCAA 


CCCGGTCTAC 


CCCTATGAAG 


AT6AAAACAG 


31620 


CTCACACCCC 


TTTATAAACC 


CTGGTTTCAT 


TTCCCXTAAT 


6GGTTTACAC 


AAAGCCCAGA 


31680 


C6GAGTTCT6 


ACACTAAATT 


GTGTTGCTCC 


CCTTACAACC 


GCTAATGGCG 


CCCTAGATAT 


31740 


CAAAGTAG6A 


GGAGGGCTTA 


AA6T6AACTC 


AACTGATGGA 


TTCTTAGAAG 


AAAACATAAA 


31800 


CATCACATCA 


CCACTTACAA 


AGTCTAACCA 


TTCTATA6GT 


TTAGAATGGA 


GCGATGGGTT 


31860 


ACAAACAAAC 


GAAGCCAAGC 


TCTGTGTCAA 


ACTTGGAAAA 


GGTCTTGTAT 


TTGACTCTTC 


31920 


CAGTGCTATT 


GCAATGGAAA 


ATAACACTTT 


6TGGACA6GT 


GCAAAACCAA 


GTGCCAACTG 


31980 


TGTAATTAAA 


GAGGGAGAAG 


ATTCCCCAGA 


CTGTAAGCTC 


ACTTTAGTTC 


TAGTGAAGAA 


32040 


TGGAGGACTG 


GTAAATGGAT 


ACATAACATT 


AATGGGAGAC 


TCAGAATATA 


CTAACACCTT 


32100 


GTTTAAAAAC 


AAACAAGTTA 


CAATAGATGT 


AAACCTCGCA 


TTTGATAATA 


CCGGCCAAAT 


32160 


TATCACTTAC 


CTATCATCTC 


TTAAAAGTAA 


CCTGAACTTT 


AAAGACAACC 


AAAACATGGC 


32220 


TACTGGAACC 


ATAACCA6TG 


CCAAA6GCTT 


CAT6CCGA6C 


ACCACOGCCT 


ATCCATTTAT 


32280 


AACATAOGCC 


ACTCAGTCCC 


TAAAT6AAGA 


TTACATTTAT 


GGAGAGTGTT 


ACTACAAATC 


32340 


TACCAATGGA 


ACTCTCTTTC 


CACTAAAAGT 


TACTGTCACA 


CTAAACA6AC 


GTATGTCAGC 


32400 


TTCTGGAATG 


GCCTATGCTA 


TGAACTTTTC 


ATGGTCTCTA 


AATGCAGAGG 


AAGCCCCTGA 


32460 


AACTACOGAA 


GTCACTCTCA 


TTACCTCCCC 


CTTCTTTTTT 


TCTTATATCA 


GAGAAGA06A 


32520 


CTGACAACAA 


AAAATAAAGA 


TTAACTTTTT 


TATTGAAATC 


AGTTTACAAG 


ATTCGAGTAG 


32580 


TTATTTTGCC 


CCCCTCTTCC 


CATTTTATAG 


AATACACAAT 


CCTCTCCC?CA 


CGCACAGCTT 


32640 


TGAACATTTG 


AATTCCATTA 


GAGATAGACA 


TAGTTTTAGA 


TTCCACATTC 


CACACAGTTT 


32700 


CAGAGCG6GC 


CAATCTTGGA 


TCAGTGATAG 


ATATAAAGCC 


ATCGGAACAG 


TCTTTCAAGG 


32760 
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TGGTTTCACA 


GTCCAACTGC 


TGCGGCTGCG 


GCTCCGGAGT 


TT6GATTAGA 


GTCATCTGGA 


32820 


A6AAGAAC6A 


TGGGAGTCAT 


AATCCGAGAA 


CGGGATCGGA 


CGGTTGTGTC 


TCAAACCTCX^ 


32680 


AAGCA6TC6C 


TGTCTGCGCC GCTCCGT60G 


ACTGCTGCTG 


ATGGGATCAG 


GATCCACAGT 


32940 


CTCTCTAAGC 


ATGATTTTAA 


TA6CCCTCAA 


CATTAACATC 


CTGGTGCGAT 


GTGCACAACA 


33000 


AC6CATTCTA 


ATCTCGCTTA 6CTCACTGCA 


GTAGGTACAA 


CACATTACCA 


CAATGTTGTT 


33060 


TAACAGGCCA 


TAATTAAAGG 


TGCTCCAGCC 


AAAACTGATC 


TCAGGGATAA 


TCATGCCOGC 


33120 


GT6ACCATCA 


TACCAGATCT 


TAAT6TAAAT 


CAAATGGCGC 


cx:ccTa::AGA 


ACACACTGCC 


33180 


CACATACATA 


ATCTCCTTGG 


6CATATGCAT 


GTTCACAATC 


TCTCTGTACC 


ATGGACAGC» 


33240 


CTG6TTAATC 


ATACAGCCCC TAATAACCTT 


CCG6AACCAA ATAGCCAGCA 


CTGCTCCCCC 


33300 


AGCAATACAT 


TGAAGAGAAC 


CCGGCTGTTT 


ACAGTGACAA 


TGAAGAACCC 


ACTTCTCTCG 


33360 


CCCATGGATC 


ACTTGAGAAT 


GAAATATATC 


TATAGTGGCA 


CAACACAAAC 


ATAAATGCAT 


33420 


GCATCTTTTC 


ATAACCCTTA 


ACTCTTCGGG 


GGTTAGAAAC 


ATATCCCAGG 


GAATGGGAAG 


33480 


CTCTTGCAAA 


ACAGTAAAGC 


TGGCAGAACA 


AGGAAGACCG 


CGAACATAAC 


TTACACTGTG 


33540 


CATGGTCAGG 


GTATTACAAT CT6GTAACAG 


TGGATGGTCT 


TCAGTCATAG 


AAGCTCTGGT 


33600 


TTCATTTTCC 


TCACAGOGTG 


GTAAAGGGGC 


CCTCAAATGA GGGTCCATGA 


TGTACGGATG 


33660 


ATGTCTGTGG 


CAT6ACGTCG 


ATCGTGCACG 


CGACCTCGTT 


GTAATGGAGC 


TGCTTCCTGA 


33720 


CATTCTCGTA 


TTTTGCATGA 


CAAAACCTAG 


CCTTA6CACA 


ACACACT7CT 


CTTCGCCTTC 


33780 


TATCCCGTCX3 


CCTAACGCAT 


TCAGTGTGGT 


AATT6AAGTA 


CAGCCATTCC 


OGTAGATTGG 


33840 


TCAAAAGTTC 


CTCGGCTTCA 


GTTGTTATGA 


AAACTCCATC 


ATGTCTGATC 


GCTCTGATAA 


33900 


AATCATTCAC 


TGTAGAATGG 


GCAATACCCA 


ACCATGCAAT 


ACAATTAGCT 


TGAGTTTTAA 


33960 


TCAAAGGAGG 


GGGAGGAAGA 


CATGGAAGAA 


cx;ataattaa 


TTTTTTATTC 


CAGACGATCT 


34020 


CGCAGTATTT 


CTAAATGAAG 


ATCACGAAGA 


TGGCACCTCT 


CGCCCCCACT 


GTGTTGATGA 


34080 


AAAATAACAG 


CTAAGTCAAA 


CACGATGCGA 


TTCTCAAGAT 


GCTCAATGGT 


GGCTTCAAGC 


34140 


AAAGCCTCCA 


CGCGCACATC 


CAAAAACAAA 


AGAACAGCAA 


AAGAAGGGGC 


ATGTTCTAAT 


34200 


TCCTCAATCA 


TCATATTACA 


TTOCTGTACC 


ATTCCCAGAT 


AATTTTCATC 


TTTCCAGCCT 


34260 


TGAATTAATC 


GTGTCATTTC 


TTCTTGTAAA 


TCCAATCCAC 


ACATGAAAAA 


CAGCTCTC6G 


34320 


AGGGCACCCT 


CCACCACGAT 


CCTTAAGCAC 


ACCCTCATAA 


T6ACAAAATA 


TCTT6CTCCT 


34380 


6T6TCACCTG 


CAGCAAATTG 


AGAATGGCAA 


CATCAAACGA 


CATGCCATTG 


TCTCTAAGCT 


34440 


CTTCTCTAAG 


TTCAAGTTGT 


AAAAACTCCT 


TCAAATCATC 


GCCAAACTGC 


TTGGCCATAG 


34500 


GTCCGCCA6G 


AATAAGAGCG 


GGGGAOGCTA 


CTGTACAGAA 


CAAACGGAGA 


CCGCCCCAAT 


34560 


GGGATCCAGC 


AAAAGTGAGG 


TTACAATAAG 


CATACTGAGA 


ACCTCCAGTG 


ATATCATCCA 


34620 


GAGTGCTGGA 


AACATAATCA 


GGCAGAGTTT 


CTCGTATAAA 


ATTAATAAAA 


GAAAATTCTG 


34680 
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CGA6AT6AAC 


ATTTAAAATT 


TCTGGAATAC 


AGATGCAATA 


AGTTACCGCG 


CTGCGCTCCA 


34740 


ACIiTTGTTAG 


TAOGATTAGT 


CTGTAAAAAA 


AGAGCACAAA AGTTATTACA 


TCATGCTAGC 


34800 


CTGGGGAAOO 


GATGGATAAA 


TCACTCTCTC 


GAAGACCAGG 


CAGGCTACAG 


6GTCTCCAAC 


34860 


ACGACCCTCiG 


TAAAACCTGT 


CAGTATGATT 


AAAAAGCATC 


ACOGAAAGAG 


GCTGTT6ATG 


34920 


AGCAGGAAAT 


ATTATTTOCG 


ATGAAGCATA 


CAATCCAGAA 


GTGTTAGTAT 


CA6TXAAAGA 


34980 


AAAAAAA06T 


CCAATATAGC 


ATCTGGGAAC 


AATTAT6CTC 


AATCTCAAAT 


GCAGCAAAGC 


35040 


6ACACCTCTG 


GGAT6CAAAG 


TAAAATCCAC 


AGGAGCATAA 


AAAATGTAAT 


TATTCCCCTC 


35100 


TTGCACAGGC 


AGCCTAGCTC 


CCXK5CCCCTC 


CAAAATCACA 


TACAAAACTT 


CAGCCATA6C 


35160 


TTACCGCACA 


AATCAGGCA6 


AGCAGACAGG 


AGAACTATAA 


ACTGACTGCC 


GCCTGTGCGC 


35220 


AATATATAGT 


CAACCTATAC 


ACTGACGTAA 


TCGGATAAAG 


TCTAAAAAAT 


CCCGCCAAAA 


35280 


CCAGCACACG 


CCCAGAAACT 


GTGTCATCCG 


CGAGAAAATT 


TCACTTCCGC 


ATTTTATTCC 


35340 


GGAAAAACGT 


CACTTCCTCT 


TTCCCACGAA 


TCGTCACTTC 


CGGTAATCTT 


GTAACGTGAC 


35400 


CTTCCCGCCC 


CGCCCCTAAC 


GGTCGCCGTC 


CCCACAGCCA 


ATCACCTTTC 


ACCCTCCCCA 


35460 


AATTCAAACX3 


CCTCATTT6C 


ATATTAACAC 


GCACCAAAAG 


TTTAAGGTAT 


ATTATTGATG 


35520 


ATOG 












35524 



(2) INFORMATION FOR SEQ ID NO: 2s 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 36519 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CCATCTTCAA TAATATACCT CAAACTTTTT GTGCGCGTTA ATATGCAAAT GAGGCGTTTG 60 

AATTTGGGGA GGAAGGGCGG TGATTGGTCG AGGGATGAGC 6ACCGTTAGG GGCGGGGCGA 120 

GTGACGTTTT GATGACGTGG TTGCGAGGAG GAGCCAGTTT GCAAGTTCTC GTGGGAAAAG 180 

TGAOGTCAAA OGAGGTGTGG TTTGAACAGG GAAATACTCA ATTTTCCCGC GCTCTCTGAC 240 

AGGAAATGAG GTGTTTCTGG GCGGATGCAA GTGAAAACGG GCCATTTTCG C6CGAAAACT 300 

GAATGAG6AA GTGAAAATCT GAGTAATTTC GOGTTTATGG CAGGGAGGAG TATTTGCCGA 360 

GGGCCGAGTA GACTTTGACC GATTACGTGG GGGTTTCGAT TACCGTGTTT TTCACCTAAA 420 

TTTCCGCGTA CGGTGTCAAA GTCCGGTGTT TTTACGTAGG TGTCAGCTGA TCGCCAGGGT 480 

ATTTAAACCT GOGCTCTCCA GTCAAGAGGC CACTCTTGAG TGCCAGCGAG AAGAGTTTTC 540 

TCCTCCGCGC CGCGAGTCAG ATCTACACTT TGAAAGATGA GGCACCTGAG AGACCTGCCC 600 
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6AT6AGAAAA 


TCATCATC6C 


TTCCGGGAAC GAGATTCTGG 


AACTGGTGGT 


AAATGCCATG 


660 


ATGGGC6AC6 


ACCCTCCGGA 


GCCCCCCACC 


CCATTTGAGA 


CACCTTCGCT 


GCACGATTTG 


720 


TAT6ATCTGG 


AGGTGGATGT 


GCCCGAGGAC 


GATCCCAATG 


AGGAGGCXK^T 


AAATGATTTT 


780 


TTTAG06AT6 


CC60GCTGCT 


AGCTGCOGAG 


GAGGCTTCGA 


GCTCTAGCTC 


AGACAGCGAC 


840 


TCTTCACTGC 


ATACCCCTA6 


ACC06GCAGA GGTGAGAAAA 


AGATCCCCGA 


GCTTAAAGGG 


900 


6AAGAGATGG 


ACTTGCGCTG 


CTATGAGGAA TGCTT6CCCC 


CGAGGGATGA 


TGAGGACGAG 


960 


GAG6C6ATCC 


AGAACGGAGC 


GAGOCAGGGA GTGCAAGCCG 


CCAGOGAGAG 


CTTTGCGCTG 


1020 


GACTGCCC6C 


CTCTGCCCGG 


ACAOGGCTGT AAGTCTTGTG 


AATTTCATCG 


GATGAATACT 


1080 


GGAGATAAA6 


CTGT6TTGTG 


TGCACTTTGC TATATGAGAG 


CTTACAACCA 


TTGTGTTTAC 


1140 


AGTAAGTGTG 


ATTAAGTTGA 


ACTTTAGAGG 


GAGGCAGAGA 


GCAGGGTGAC 


T6GGCGATGA 


1200 


CTGGTTTATT 


TATGTATATA 


TGTTCTTTAT 


ATAGGTCCCG 


TCTCTGACGC 


AGATGATGAG 


1260 


ACCCCCACTA 


CAAAGTCCAC 


TTCGTCACCC 


CCAGAAATTG 


GCACATCTCC 


ACCTGAGAAT 


1320 


ATTGTTAGAC 


CAGTTCCTGT 


TAGAGCCACT 


GGGAGGAGAG 


CAGCTGTGGA 


ATGTTTGGAT 


1380 


6ACTTGCTAC 


AGGGTGGGGT 


T6AACCTTTG 


GACTTGTGTA 


CCCGGAAACG 


CCCCAGGCAC 


1440 


TAAGTGCGAC 


ACAT6TGTGT 


TTACTTGAGG 


TGATGTCAGT 


ATTTATAGGG 


TGTGGAGTGC 


1500 


AATAAAAAAT 


6TGTTGACTT 


TAAGTGOGTG 


GTTTATGACT 


CAGGGGTGGG 


GACTGTGAGT 


1560 


ATATAAGCAG 


GTGCAGACCT 


GTGTGGTTAG 


CTCAGAGC6G 


CATGGAGATT 


TGGACGGTCT 


1620 


TG6AAGACTT 


TCACAAGACT 


AGACAGCTGC 


TAGAGAACGC 


CTCGAACGGA 


GTCTCTTACC 


1680 


TGTGGA6ATT 


CTGCTTCGGT 


GGCGACCTAG 


CTAGGCTAGT 


CTACAGGGCC 


AAACAGGATT 


1740 


ATAGTGAACA 


ATTTGAGGTT 


ATTTTGAGAG 


A6TGTTCTGG 


TCTTTTTGAC 


GCTCTTAACT 


1800 


TGGGCCATCA 


GTCTCACTTT 


AACCAGAGGA 


TTTCGAGAGC 


CCTTGATTTT 


ACTACTCCTG 


1860 


GCAGAACCAC 


TGCAGCAGTA 


GCCTTTTTTG 


CTTTTATTCT 


TGACAAATGG 


AGTCAAGAAA 


1920 


CCCATTTCAG 


CAGGGATTAC 


CAGCTGGATT 


TCTTA6CAGT 


AGCTTTGTGG 


AGAACATGGA 


1980 


AGTGCCAGCG 


CCTGAATGCA 


ATCTCCGGCT 


ACTTGCCGGT 


ACAGCCGCTA 


GACACTCTGA 


2040 


6GATCCTGAA 


TCTCCAGGAG 


AGTCCCAGGG 


CACGCCAACG 


TCGCCAGCAG 


CAGGAGCAGG 


2100 


A66AGGATCA 


AGAAGAGAAC 


CCGAGAGCCG GCCTGGACCC 


TCCGGCGGAG 


GAGGAGGAGT 


2160 


AGCTGACCT6 


TTTCCTGAAC 


TGCGCCGGGT 


GCTGACTAGG 


TCTTCGAGTG 


GTCX^AGAG 


2220 


GGGGATTAA6 


CGGGAGAGGC 


ATGATGAGAC 


TAATCACAGA 


ACTGAACTGA 


CTGTGGGTCT 


2280 


GATGAGTCGC 


AAGCGCCCAG 


AAACAGTGTG 


GT6GCATGAG 


GTGCAGTCGA 


CTGGCACAGA 


2340 


TGA6GTOT0G 


GTGATGCAT6 


AGAGGTTTTC 


TCTAGAACAA 


GTCAAGACTT 


GTTGGTTAGA 


2400 


GCCSTGAGGAT 


GATTGGGAGG 


TAGCCATCAG 


GAATTATGCC 


AAGCTGGCTC 


TGAGGCCAGA 


2460 


CAAGAAGTAC 


AAGATTACTA 


AGCTGATAAA 


TATCAGAAAT 


GCCTGCTACA 


TCTCAGGGAA 


2520 
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TQG6GCTGAA GTGGAGATCT GTCTCCAGGA AAGGGTGGCT TTCAGATGCT GCATGATGAA 2580 

TAT6TACCCG GGAGTGGTGG GCATGGATGG GGTTACCTTT ATGAACATGA GGTTCAGGGG 2640 

AGATGGGTAT AATG6CAC6G TCTOTATGGC CAATACCAAG CTGACAGTCC ATGGCTGCTC 2700 

CTTCTTTGGG TTTAATAACA CCTGCATCGA GGCCTGG66T CAG6TCGGTG TGAGGGGCTG 2760 

GA6TTTTTCA GCCAACTG6A TGGGGGTOGT GGGCAGGACC AAOAGTATGC TGTCCGTOAA 2820 

GAAATGCTTG TTTGA6AGGT GCCACGTGGG G6TGATGAGC GAGGGCGAAG CCAGAATC06 2880 

CCACTGCGCC TCTACCGA6A OGGGCT6CTT TGTGCTGTGC AA6G6CAATG CTAAGATCAA 2940 

GCATAATATG ATCTGTGGAG CCTCSGGAC9GA 6CGCG0CTAC CA6ATGCTGA CCTGCXSCCGG 3000 

CGGGAACAGC CATATGCTGG CGACCGTACA TGTG6CTTCC CATGCTCGCA AGCCCTGGCC 3060 

CGAGTTCGAG CACAATGTCA TGACCAGGTG CAATATGCAT CTGGGGTCCC GCCGAGGCAT 3120 

GTTCATGCCC TACCAGTGCA ACCTGAATTA TGTGAAGGTG CTGCTGGAGC CCGATGCCAT 3180 

GTCCAGAGTG AGCCTGACGG GGGTGTTT6A CATGAATGTG GAGGTGTGGA AGATTCTGAG 3240 

ATAT6ATGAA TCCAAGACCA GGTGCC6A6C CTGCGAGTGC GGAGGGAAGC ATGCCAGGTT 3300 

CCA6CC06TG TGTGTG6ATG TGACGGA6GA CCTGCGACCC GATCATTT6G XGTTGCCCTG 3360 

GACCGGGAC6 GAGTTCGGTT 0CAGC6GG6A AGAATCT6AC TAGAGTGAGT AGTGTTCT6G 3420 

GGC6G6GGAG GACCTGCATG AG6GCCAGAA TAACTGAAAT CTGTGCTTTT CT6TGTGTTG 3480 

CAGCAGCATG AGCGGAAGCXy GCTCCTTTGA GGGA6GGGTA TTCAGCCCTT ATCTGACGGG 3540 

GCGTCTCCCC TCCTGGGOGG GAGTGOGTCA GAATGTGATG GGATCCACGG TGGAC6GCCG 3600 

GCCCGTGCAG CCCGCGAACT CTTCAACCCT GACCTATGCA ACCCTGAGCT CTTCGTOGTT 3660 

GGACGCAGCT GCCGCCGCAG CTGCTGCATC TGCCGCCAGC GCCGTGCX3CX; GAATGGCCAT 3720 

GGGCGCCGGC TACTACXfGCA CTCTGGTGGC CAACTCGAGT TCCACCAATA ATCCCGCCAG 3780 

CCTGAACGAG GAGAAGCTGT TGCTGCTGAT GGCCCAGCTC GAGGCCTTGA CCCAGCGCCT 3840 

GGGCXSAGCTG ACCCAGCAGG TG6CTCAGCT GCAGGAGCAG ACGCGG6CCG CG6TTGCCAC 3900 

66TGAAATCC AAATAAAAAA TGAATCAATA AATAAAOGGA 6ACGGTTGTT GATTTTAACA 3960 

CAGAGTCTGA ATCTTTATTT G AT TTTTCGC GOGCGGTAGG CCCTGGACCA CC6GTCTCGA 4020 

TCATTGA6CA CCOGGTG6AT CTTTTCCAGG ACCCGGTAGA 6GTGGGCTTG GAT6TTGAGG 4080 

TACATGGGCA T6A6CCCGTC CCGGG6GTGG AGGTAGCTCC ATTGCAGGGC CTC6TGCTCG 4140 

GGGGTGGTGT TGTAAATCAC CCA6TCATAG CAGGGGOGCA GGGCATGGTG TTGCACAATA 4200 

TCTTTGAGGA GGAGACTGAT GGCCACGGGC AGCCCTTTGG TGTAGGTGTT TACAAATCTG 4260 

TTGAGCTGGG AGGGATGCAT GCGGGGGGAG ATGAGGTGCA TCTTGGCCTG GATCTTGAGA 4320 

TTGGCGATGT TACCGCCCAG ATCCCGCCTG GGGTTCATGT TGTGCAGGAC CACCAGCACG 4380 

6TGTATCCGG TGCACTTGGG GAATTTATCA TGCAACTTGG AAGGGAAGGC GTGAAAGAAT 4440 
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TTG6CGACGC 


CTTTGTGCCC 


GCCCAGGTTT TCCATGCACT CATCCATGAT 


GATGGCGATG 


4500 


GGCCCGT6GG 


OGGCGGCCTG 


GGCAAAGACG TTTCGGGGGT OGGACACATC 


ATAGTTGTGG 


4560 


TCCT6GGT6A 


GGTCATCATA 


GGCCATTTTA ATGAATTTGG GGCGGAGGGT 


GCCGGACTGG 


4620 


GG6ACIUUVG6 


TACCCTCGAT 


CCCGGGGGGG TAGTTCCCCT CACAGATCTG 


CATCTCCGAG 


4680 


GCTTT6AGCT 


OGGAGGGGGG 


GATCATGTCC ACCTGCGGGG CGATAAAGAA 


CACGGTTTCC 


4740 


GGGGCGGGGG 


AGATGAGCTG 


GGCOGAAAGC AAGTTCCGGA GCAOCTGGGA 


CTTGCOGCAG 


4800 


CCGGTGGGGC 


CGTAGATGAC 


CCCGATGACC GGCTGCAGGT G6TAGTT6AG 


GGAGAGACAG 


4860 


CTGCCX9TCCT 


CCCGGAGGA6 


GGGGGCCACC TCGTTCATCA TCTCXjCGCAC 


GTGCATGTTC 


4920 


TCGCGCACCA 


GTTCCGCCAG 


GAGGCGCTCT CCCCCCAGGG ATAGGAGCTC 


CTGGAGCGAG 


4980 


GCGAAGTTTT 


TCAGCGGCTT 


GAGTCCGTCG GCCATGGGCA TTTT6GAGAG 


GGTTTGTTGC 


5040 


AAGAGTTCCA 


GGCGGTCCCA 


GAGCTCGGTG ATGTGCTCTA CGGCATCTCG 


ATCCAGCAGA 


5100 


CCTCCTCGTT 


TCGCGGGTTG 


GGACGGCTGC GGGAGTAGG6 CACCAGACGA 


TGGGCGTCCA 


5160 


6CGCAGCCAG 


GGTCCGGTCC 


TTCCAGGGTC GCAGCGTCOG CX5TCAGGGTG 


GTCTCCGTCA 


5220 


CGGTGAAGGG 


GTGCGCGCCG 


GGCTGGGCGC TTGCGAGGGT GCGCTTCAGG 


CTCATCCGGC 


5280 


TGGXCGAAAA 


CCGCTCCC6A 


TOGGCGCCCT GOGCGTOGGC CAGGTAGCAA 


TTGACCATGA 


5340 


GTT06TAGTT 


GAGCGCCTOG 


GCCGCGTGGC CTTTGGOGCG GAGCTTACCT 


TTGGAAGTCT 


5400 


GCCCX3CAGGC 


GGGACAGAGG 


AGGGACTTGA GGGCGTAGAG CTTGGGGGCG 


AGGAAGACGG 


5460 


ACT06GGGGC 


GTAGGCGTCC 


60GC0GCAGT GGGC6CAGAC GGTCTCGCAC 


TCCACGAGCC 


5520 


AGGTGAGGTC 


GGGCTGGTCG 


GGGTCAAAAA CCAGTTTCCC GCCGTTCTTT 


TTGATGOGTT 


5580 


TCTTACCTTT 


GGTCTCCATG 


AGCTCGTGTC CCCGCTGGGT GACAAAGAGG 


CTGTCCGTGT 


5640 


CCCCGTAGAC 


CGACTTTATG 


GGCGGGTCCT OGAGCGGTGT GCCGCGGTCC 


TCCTCGTAGA 


5700 


GGAACCCCGC 


CCACTCCGAG 


ACGAAAGCCC GGGTCCAGGC CAGCACGAAG 


GA6GCCACGT 


5760 


GGGAC6GGTA 


6CGGTOGTTG 


TCCACCAGCG GGTCCACCTT TTCCAGGGTA 


TGCAAACACA 


5820 


TGTCCCCCTC 


GTCCACATCC 


AGGAAGGT6A TTGGCTTGTA AGTGTAGGCC 


ACGTGACC6G 


5880 


6GGTCC06GC 


CGGGGGGGTA 


TAAAAGGGTG OGGGTCCCTG CTCGTCCTCA 


CTGTCTTCCG 


5940 


GATCGCTGTC 


CAG6AGCGCC 


AGCTGTTGGG GTAGGTATTC CCTCTCGAAG 


GCGGGCATGA 


6000 


CCTCGGCACT 


CAGGTTGTCA 


GTTTCTAGAA ACGAGGAGGA TTTGATATTG 


ACGGTGCCGG 


6060 


OGGAGATGCC 


TTTCAAGAGC 


CCCTCGTCCA TCTGGTCAGA AAAGACGATC 


TTTTTGTTGT 


6120 


CGAGCTTGGT 


GGCGAA6GAG 


COGTAGAGGG CGTTGGAGAG GAGCTTGGCG 


ATGGAGOGCA 


6180 


TGGTCTGGTT 


TTTTTCCTTG 


TCGGOGCGCT CCTTG6CGGC 6ATGTTGAGC 


TGGACGTACT 


6240 


CGCGCGCCAC 


GCACTTCCAT 


TCGGGGAAGA CGGTGGTCAG CTCGTCX^GGC 


ACGATTCTGA 


6300 


CCTGCCAGCC 


CCGATTATGC 


AGGGTGATGA GGTCCACACT GGTGGCCACC 


TCGCCGCGCA 


6360 
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OGGGCTCATT 


AGTCCAGCAG 


AGGCGTCCGC 


CCTTGCGCGA 


GCAGAAGGGG 


GGCAGGGGGT 


6420 


CCA6CAT6AC 


CTCGTCGGGG 


GGGTCGGCAT 


CGATGGTGAA 


GATGCCGGGC 


AGGAGGTCGG 


6480 


GGTCAAA6TA 


GCT6ATGGAA 


GTGGCCAGAT 


OGTCCAGGGC 


AGCTTGCCAT 


TCGCGCACGG 


6540 


CCA6C60G06 


CTC6TAGG6A 


CTCAGGG6CG 


TGCCCCAG6G 


CATGGGATGG 


GTAAGOGCGG 


6600 


A66CGTAGAT 


GCCGCA6ATG 


TOGTAGACGT 


AGAGGGGCTC 


CTC3GAGGATG 


CCGAT6TA6G 


6660 


TG6GGTAGCA 


GC6CCCCCCG 


CGGATGGTGG 


CGCGCAOGTA 


GTCATACAGC 


TCGTGCGAG6 


6720 


6GGCGA6GA6 


CCCOGGGCCC 


A6GTTGGTGG 


GACTG6GCTT 


TTCGGCGCGG 


TAGACGATCT 


6780 


6GC66AAAAT 


GGCATGC6A6 


TTGGAGGA6A 


TGGTGGGCCT 


TTGGAAGATG 


TTGAAGT666 


6840 


C6T6G6GCA6 


TCCGACCGAG 


TCGCGGATGA 


AGTGGGCGTA 


GGAGTCTTGC 


AGCTTGGCGA 


6900 


CGAGCTCGGC 


GGTGACTAGG 


ACGTCCAGAG 


CGCAGTAGTC 


GAGGGTCTCC 


TGGATGATGT 


6960 


CATACTTGAG 


CTGTCCCTTT 


TGTTTCCACA 


GCTCGCGGTT 


GAGAAGGAAC 


TCTTCGCGGT 


7020 


CCTTCCAGTA 


CTCTTCGAGG 


GGGAACCCGT 


CCTGATCTGC 


ACGGTAAGAG 


CCTAGCATGT 


7080 


AGAACT6GTT 


GACG6CCTTG 


TAGGOGCAGC 


AGCCCTTCTC 


CACGG66AGG 


GCGTAGGCCT 


7140 


GGGCGGCCTT 


GCGCAGGGAG 


GTGTGOGTGA 


6GGCGAAAGT 


GTCCCTGACC 


ATGACCTTGA 


7200 


GGAACTGGTG 


CTT6AA6T0G 


ATATC6T06C 


AGCCCCCCTG 


GTCGGAGAGC 


TGGAAGTCCG 


7260 


TGOGCTTCTT 


GTAGG0GGG6 


TTGGGCAAAG 


CGAAAGTAAC 


ATCGTT6AAG 


AG6ATCTTGC 


7320 


C06CGC6GGG 


CATAAAGTTG 


0GA6TGATGC 


GGAAAGGTTG 


GGGCACCTCG 


GCCCGGTTGT 


7380 


TGAT6ACCTG 


6GCGGCGAGC 


AOGATCTCGT 


CGAAGCCGTT 


GATGTT6TGG 


CCCACGATGT 


7440 


A6AGTTCCAC 


GAATOGGGGA 


CGGCCCTTGA 


CGTGGGGCAG 


TTTCTTGA6C 


TCCTCGTAGG 


7500 


TGAGCTCGTC 


GGGGTCGCTG 


AGCCCGTGCT 


GCTCGAGCGC 


CCAGTCGGCG 


AGATGGGGGT 


7560 


TGGCGCGGAG 


GAAGGAAGTC 


CAGAGATCCA 


CGGCCAGGGC 


GGTTTGCAGA 


CGGTCCCGGT 


7620 


ACTGACGGAA 


CTGCTGCCCG 


ACGGCCATTT 


TTTCGGGGGT 


GACGCAGTAG 


AAGGTGC6GG 


7680 


6GTCCCCGTG 


CCA60GATCC 


CATTTGAGCT 


G6AGGGCGAG 


ATCGAGGGCG 


AGCT06ACGA 


7740 


6C0GGTG6TC 


CCCGGAGAGT 


TTCATGACCA 


GCATGAAGGG 


GA0GA6CTGC 


TTGCCGAAGG 


7800 


ACCCCATCCA 


GGTGTAGGTT 


TCCACATCGT 


AGGT6AGGAA 


GA6CCTTTCG 


GTG0GA6GAT 


7860 


GCX^AGCOGAT 


GG66AAGAAC 


TGGATCTCCT 


GCCACCAATT 


GGAGGAATGG 


CT6TTGATGT 


7920 


6AT6GAAGTA 


6AAATGC06A 


CGGCGCGCCG 


AACACTCGTG 


CTTGTGTTTA 


TACAAGOGGC 


7980 


CACAGTGCTC 


GCAAOGCTGC 


ACGGGATGCA 


C6TGCTGCAC 


6A6CTGTACC 


TGAGTTCCTT 


8040 


TGACGAGGAA 


TTTCAGTGGG 


AAGTGGAGTC 


6TGGCGCCTG 


CATCTCGTGC 


TGTACTACGT 


8100 


OGTGGTGGTC 


GGCCT6GCCC 


TCTTCTGCCT 


OGATGGTGGT 


CATGCTGACG 


AGCCCGCGCG 


8160 


GGAGGCAGGT 


CCAGACCTCG 


GCGCGAGCGG 


GTCGGAGAGC 


GAGGACGAGG 


GCGCGCAGGC 


8220 


CGGAGCTGTC 


CAGGGTCCTG 


AGACGCTGCG 


GAGTCAGGTC 


AGTGGGCAGC 


GGCGGCGCGC 


8280 
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GOTTGACTTG 


CAGGAGTTTT 


TCCAGGGCGC 


GCGGGAGGTC 


CAGATGGTAC 


TTGATCTCCA 


8340 


CCGCXSCCATT 


GGTGGCGACG 


TCGATGGCTT 


GCAGGGTCCC 


GTGCCCCTGG 


GGTGTGACCA 


8400 


CCGTCCCCCX; 


TTTCTTCTTG 


GGCGGCTGGG 


gcgacx;gggg 


CGGTGCCTCT 


TCCATGGTTA 


8460 


GAAGCGGC6G 


CGAGGACGOG 


C6CCGGGCGG 


CAGGGGCGGC 


TCGGGGCCCG 


GAGGCAGGGG 


8520 


CG6CAGGG6C 


ACGT0660GC 


06C60GCGGG 


TAGGTTCTGG 


TACTGCGCCC 


GGAGAAGACT 


8580 


GGGGTGAGCG 


ACGACGCGAC GGTTGAOGTC 


CTGGATCT6A 


06CCTCTG6G 


TGAAGGCCAC 


8640 


G6GACCG6TG 


A6TTTGAACC 


TGAAA6AGAG 


TTOGACAGAA 


TCAATCTCGG 


TATCGTTGAC 


8700 


GGCGGCCTGC 


C6CAGGATCT 


CTTGCACGTC 


6CCCGA6TTG 


TCCTGGTAGG 


CGATCTCGGT 


8760 


CATGAACTGC 


TCGATCTCCT 


CCTCTTGAAG 


GTCTCCGCGG 


C06GCG0GCT 


CCAC6GTGGC 


8820 


CGCX^AGGTCG 


TT6GAGATGC 


GGCCCATGAG 


CT6C6AGAAG 


GOGTTCATGC 


CCGCCTCGTT 


8880 


CCAGACGCGG 


CTGTAGACCA 


CGACGCCCTC 


GGGATCGCCG 


GC60GCATGA 


CCACCTGGGC 


8940 


GAGGTTGAGC 


TCCACGTGGC 


GCGTGAAGAC 


CGCGTAGTTG 


CAGAGGCGCT 


GGTAGAGGTA 


9000 


GTTGAGOGTG 


GTGGCGATGT 


GCTCGGTGAC 


GAAGAAATAC 


ATGATCCAGC 


GGCGGAGCGG 


9060 


CATCTCGCTG 


ACGTCGCCCA 


gcgcctcx:aa 


AOGTTCCATG 


GCCTCX3TAAA 


AGTCCACGGC 


9120 


GAA6TTGAAA 


AACTGGGAGT 


TGCGCGCOGA 


GACGGTCAAC 


TCCTCCTCCA 


GAAGACGGAT 


9180 


GAGCTCGGC6 


ATGGT6GCGC 


GCACCTCGCG 


CTCGAAGGCC 


CCCGGGA6TT 


CCTCCACTTC 


9240 


CTCTTCTTCC 


TCCTCCACTA ACATCTCTTC 


TACTTCCTCC 


TCAGGCGGCA 


GTGGTGGOGG 


9300 


G6GAGGGGGC 


CTGC6T0GCC GGC6GCGGAC 


GGGCAGACGG 


TCGATGAAGC 


GCTCX3ATGGT 


9360 


CTGGCCGCX3C 


CGGC6T0GCA 


TGGTCT066T 


6AC6GCGCGC 


CC6TCCTCGC 


GGGGCCGGAG 


9420 


CGTGAAGACG 


CCX^CCGOGCA 


TCTCCAGGTG 


GCOGGGGGGG 


TCCCCGTTGG 


GCAGGGAGAG 


9480 


GGCGCTGACG 


ATGCATCTTA 


TCAATTGCCC 


CGTAGGGACT 


COGOGCAAGG 


ACCTGAGCGT 


9540 


CTCGAGATCC 


ACGGGATCTG 


AAAACCGCTG 


AACGAAGGCT 


TCGAGCCAGT 


CGCAGTCGCA 


9600 


AGGTAGGCTG 


AGCA06GTTT 


CTTCTGGCGG 


GTCATGTTGG 


TTGGGAGCGG 


GGCGGGCGAT 


9660 


GCTGCTG6TG 


ATGAAGTTGA AATAGGCGGT 


TCTGAGACGG 


CGGATGGTGG 


CX^AGGAGCAC 


9720 


CA6GTCTTTG 


GGCCCGGCTT 


GCTGGATGCG 


CAGACGGTCG 


GCCATGCCCC 


AGGCGTGGTC 


9780 


CTGACACCTG 


GCCAGGTCCT 


TGTAGTAGTC 


CTGCATGAGC 


CGCTCCACGG 


GCACCTCCTC 


9840 


CTCGCCCGCG 


CGGCCGTGCA 


TGCGC6TGAG 


CCXGAAGCOG 


CGCTGGGGCT 


GGACGAGOGC 


9900 


CAGGTCGGCG 


ACGACGOGCT C6G0GAGGAT 


GGCTTGCTGG 


ATCTGGGTGA 


GGGTGGTCTG 


9960 


GAAGTCATCA 


AA6T0GACGA AGCGGTG6TA 


GGCTCCGGTG 


TTGATGGTGT 


AGGAGCAGTT 


10020 


GGCCATGACG 


GACCAGTTGA 


0G6TCTGGTG 


GCCCGGA06C 


ACGAGCTCGT 


GGTACTTGAG 


10080 


GCGCGAGTAG 


GCGOGCGTGT 


CGAAGATGTA 


GTCGTTGCAG 


6TGCGCACCA 


GGTACTGGTA 


10140 


GCCGATGAGG 


AAGTGCGGCG 


GCGGCTGGCG 


GTAGAGCGGC 


CATCGCTCGG 


TGGCGGGGGC 


10200 
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GCCGGGCGCG AGGTCCTCGA GCATGGTGCG GTGGTAGCCG 


TAGATGTACC 


TGGACATCCA 


10260 


GGTGAT6CCG G0GGCGGT6G T6GAGGCGCG OGGGAACTCG 


CGGACGCGGT 


TCCAGATGTT 


10320 


GOGCAGCGGC AGGAAOTAGT TCATGGTGGG CAOGGTCTGG 


CCCGTGAGGC 


GOGCGCAGTC 


10380 


GTGGATGCTC TATACGGOCA AAAA06AAAG C6GTCAG0GG GTCGACTCCG TGGCCTGOAG 


10440 


GCTAAGOGAA C6GGTTGGGC TGGGCGTGTA CCCCGGTT06 AATCTC6AAT CAGGCTGGAG 


10500 


CCGCAGCTAA 06TGGTATTG GCACTCCCGT CTCGACCCAA 


GCCTGCACCA 


ACCCTCCAGG 


10560 


ATACGGA6GC GGGTCGTTTT GCAACTTTTT TTTGGA6GCC 


GGATGAGACT 


AGTAAGCGCG 


10620 


GAAAGCGGCC GACCGCGATG GCTCGCTGCC GTAGTCT6GA 


GAAGAATCGC 


CAGGGTTGCG 


10680 


TTGCGGTGTG CCCCGGTTCG AGGCCGGCCG GATTCCGC5GG 


CTAACGAGGG 


CGTGGCTGCC 


10740 


CCGTCGTTTC CAAGACCCCA TAGCCAGCCX5 ACTTCTCCAG 


TTACGGAGCX; 


AGCCCCTCTT 


10800 


TTGTTTTGTT TGTTTTTGCC AGATGCATCC CGTACTGCGG 


CAGATGCGCC 


CCCACCACCC 


10860 


TCCACCGCAA CAACAGCCCC CTCCACAGCC GGCGCTTCTG 


CCCCCGCCCC 


AGCAGCAACT 


10920 


TCCAGCCACG ACCGCCGCGG CCGCCGTGAG CGGGGCTGGA 


CAGAGTTATG 


ATCACCAGCT 


10980 


GGCCTTGGAA 6AGGG0GA6G GGCTGGOGCG CCTGGGGGCG 


TCGTCGCCGG 


AGCGGCACCC 


11040 


GCGCGTGCAG ATGAAAAGGG ACGCTCGCGA GGCCTACGTG 


CCCAAGCAGA 


ACCTGTTCAG 


11100 


A6ACAGGAGC GGCGAGGAGC CCGA6GAGAT GCGCGCGGCC 


CGGTTCCACG 


CGGGGCGGGA 


11160 


GCTGCGGCGC GGCCTGGACC GAAAGAGGGT GCTGAGGGAC GAGGATTTOG AGGCGGACGA 


11220 


GCTGAGGGGG ATCAGCCCC6 06CGCGCGCA CGTGGCCGOG 


GCCAACCTGG 


TCACGGCGTA 


11280 


C5GAGCAGACC GTGAAGGAGG AGAGCAACTT CCAAAAATCC 


TTCAACAACC 


ACGTGCGCAC 


11340 


CCTGATCGCG CGCGAGGAGG TGACCCTGGG CCTGATGCAC 


CTGTGGGACC 


TGCTGGAGGC 


11400 


CATCGTGCAG AACCCCACCA GCAAGCCGCT GACGGCGCAG 


CTGTTCCTGG 


TGGTGCAGCA 


11460 


TAGTCGGGAC AACGAAGCGT TCAGGGAGGC GCTGCTGAAT 


ATCACCGAGC 


COGAGGGCCG 


11520 


CTGGCTCCTG GACCTGGTGA ACATTCTGCA GAGCATOGTG 


GTGCAGGAGC 


GCGGGCTGCC 


11580 


GCT6TCCGAG AAGCTGGCGG CCATCAACTT CTOGGTGCTG 


AGTTTGGGCA 


AGTACTACGC 


11640 


TAGGAAGATC TACAAGACCC CGTACGTGCC CATAGACAAG 


GAGGTGAAGA 


T06ACGGGTT 


11700 


TTAGAT6CGC ATGACCCTGA AAGTGCTGAC CCTGAGCGAC GATCTGGGGG TGTACCGCAA 


11760 


CGACAGGATG CACCGTGCGG TGAGC6CCAG CAGGCiGGCGC 


GAGCTGAGCG 


ACCAGOAGCT 


11820 


GATGCATAGT CTGCAGCGGG CCCTGACCGG GGCCGGGACC GAGGGGGAGA GCTACTTTGA 


11880 


CATGGGCGCG GACCTGCACT GGCAGCCCAG CCGCCGGGGC 


TTGGAGGCGG 


CGGCAGGACC 


11940 


CTACGTAGAA GAGGTGGACG ATGAGGTGGA CGAGGAGGGC 


GAGTACCTGG 


AAGACTGATG 


12000 


GCGCGACCGT ArTTTTGCTA GATGCAACAA CAACAGCCAC 


CTCCTGATCC 


CGCGATGCGG 


12060 


GC!GGCGCTGC AGAGCGAGCC GTCCGGCATT AACTCCTCGG 


AGGATTGGAC 


CCAGGCCATG 


12120 
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CAA06CATCA 


TGGCGCTGAC 


GACCCGCAAC 


CCC6AAGCCT 


TTAGACAGCA 


GCCCCAGGCC 


12180 


AACCGGCTCT 


CGGCCATCCT 


GGAGGCCGTG 


GTGCCCTCGC 


GCTCCAACCC 


CACGCACGAG 


12240 


AA6GTCCTG6 


CCATCGTGAA 


cgcxk:tggtg 


GAGAACAAGG 


CCATCC6CGG 


CGACGAGGCC 


12300 


66CCTGGT6T 


ACAACGCGCT 


GCTGGAGCGC 


GTGGCCCGCT 


ACAACAGCAC 


CAACGTGCAG 


12360 


ACCAACCTGG 


ACOGCATGGT 


GACCGACGTG 


OGCGAGGOCX; 


TGGCCCAGCG 


CGAGCGGTTC 


12420 


CA0CGC6A6T 


CCAACCTGGG 


ATCCATGGTG 


606CTGAACG 


CCTTCCTCAG 


CACCCAGCCC 


12480 


GCCAA06T6C 


CCCG6GGCCA 


GGA6GACTAC 


ACCAACT7CA 


TCAGCGCCCT 


6CX3CCTGATG 


12540 


GT6ACC6AG6 


TGCCCCAGAG 


GGAGGTGTAC 


CAGTCG6GGC 


CGGACTACTT 


CTTCCAGACC 


12600 


AGTCGCCAGG 


6CTTGCAGAC 


CGTGAACCTG AGCCAGGCTT 


TCAAGAACTT 


GCAGGGCCTG 


12660 


TGGGGCGTGC 


A6GCCC06GT 


CGGGGACCGC 


GCGACGGTGT 


C6AGCCTGCT 


GACGCOGAAC 


12720 


TCGCGCCTGC 


TGCTGCTGCT 


GGTGGCCCCC 


TTCACGGACA 


GCGGCA6CAT 


CAACCGCAAC 


12780 


TCGTACCTGG 


GCTACCTGAT 


TAACCTGTAC 


CGCGAGGCCA 


TCGGCCAGGC 


GCACGTGGAC 


12840 


GAGCAGACCT 


ACCAGGAGAT 


CACCCACGTG 


AGCCGCGCCC 


TGGGCCAGGA 


C6ACCCGGGC 


12900 


AACCTGGAAG 


CCACCCTGAA 


CTTTTTGCTG 


ACCAACC6GT 


GGCAGAAGAT 


CCCGCCCCAG 


12960 


TAOGCGCTCA 


GCACCGAGGA 


GGAGOGCATC 


CTGCGTTACG 


T6CAGCAGAG 


CGTGGGCCTG 


13020 


TTCCTGATGC 


AGGAGGGGGC 


CACCCCCAGC 


6CC6CGCTCG 


ACATGACCGC 


GCGCAACATG 


13080 


GAGCCCAGCA 


TGTAOGCCAG 


CAACCGCCCXy TTCATCAATA 


AACTGATGGA 


CTACTTGCAT 


13140 


06G6CGGCC6 


CCATGAACTC 


TGACTATTTC 


ACCAA06CCA 


TCCTGAATCC 


CCACTGGCTC 


13200 


C0GCCGC06G 


GGTTCTAGAC 


GGGCGA6TAC GACAT6CC0G 


ACCCCAATGA 


CGGGTTCCTG 


13260 


T6GGACGATG 


TGGACAGCAG 


CGTGTTCTCC 


CCCCGACCGG 


GTGCTAACXSA 


GOGCCCCTTG 


13320 


TG6AAGAAGG 


AAGGCAGCGA 


CCGACGCCCG 


TCCTCGGCGC 


TGTCCGGCCG 


CGAGGGTGCT 


13380 


GCCGCGGOGG 


TGCCCGA6GC 


CGCCAGTCCT 


TTCCCGAGCT 


TGCCCTTCTC 


GCTGAACAGT 


13440 


ATCCGCAGCA 


GOGAGCTGGG 


CAGGATCACG 


CGCCCGCX;CT 


TGCTGGGCX^A 


AGAGGAGTAC 


13500 


TTGAATGACT 


CGCTGTTGAG 


ACCOGAGCGG 


GAGAAGAACT 


TCCCCAATAA 


CGGGATAGAA 


13560 


A6CCTGGTGG 


ACAA6ATGAG 


CCGCTGGAAG 


AOGTATGCGC 


AGGAGCACAG 


GGACGATCCC 


13620 


CGGGC6TCGC 


AGGGGGCCAC 


GAGCCXM3GGC 


AGCGCCGCX:C 


GTAAAOGCCG 


GTGGCACGAC 


13680 


AG6CAG0GGG 


GACAGAT6TG 


GGACGATGA6 


GACTCOGCCXS 


ACGACAGCAG 


CGTGTTGGAC 


13740 


TTG6GTGGGA 


GTGGTAACCC 


GTTC6CTGAC CTGOGCCCCC 


GTATGGGGOG 


CATGATGTAA 


13600 


GAGAAACCGA 


AAATAAATGA 


TACTCACCAA 


GGCCATGGCG 


ACCAGOGTGC 


GTTCGTTTCT 


13860 


TCTCT6TT6T 


TGTTGTATCT 


AGTATGATGA 


GGCGTGOGTA 


CCCGGAGG6T 


CCTCCTCCCT 


13920 


CGTACGAGAG 


CGTGATGCAG 


CAGGCGATGG 


CGGCGGCGGC 


GATGCAGCCC 


COGCTGGAGG 


13980 


CTCCTTACGT 


GCCCCCGCGG 


TACCTGGCGC 


CTACGGAGGG 


GCGGAACAGC 


ATTCGTTACT 


14040 
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C60A6CTGGC 


ACX:C?TTGTAC 


GATACCACCC 


GGTTGTACCT 


GGTGGACAAC 


AAGTCGGCGG 


14100 


ACATCGCCTC 


GCTGAACTAC 


CAGAACGACC 


ACAGCAACTT 


CCTGACCACC 


GTGGTGCAGA 


14160 


ACAAT6ACTT 


CACCCCCACG 


GAGGCCAGCA 


CCCAGACCAT 


CAACTTTGAC 


GAG0GCTO6C 


14220 


6GTG66G06G 


CCAGCTGAAA ACCATCATGC 


ACACCAAGAT 


GCCCAACGTG 


AACGAGTTCA 


14280 


TGTACAGCAA 


CAAGTTCAAG 


GCXSOGGGTGA 


TGGTCTCCCG 


CAAGACCCCC 


AATGG6GTGA 


14340 


CAGTGACAGA 


G6ATTATGAT GGTAGTCAGG 


ATGA6CTGAA 


GTATGAATGG 


GTGGAATTTG 


14400 


A6CTGCCCGA 


AGGCAACTTC 


TOGGTGACCA 


TGACCATOGA 


CCTGATGAAC 


AACGCCATCA 


14460 


T06ACAATTA 


CTTGGCGGTG 


GGGCXK^CAGA 


ACGGGGTGCT 


GGAGAGCGAC 


ATCGGOGTGA 


14520 


AGTTCGACAC 


TAGGAACTTC 


AGGCTGGGCT 


GGGACCCCGT 


GACCGAGCTG 


GTCATGCCCG 


14580 


GGGTGTACAC 


CAACGAGGCT 


TTCCATCCCG 


ATATTGTCTT 


GCTGCCCGGC 


TGCGGGGTGG 


14640 


ACTTCACCGA 


GAGCCGCCTC 


AGCAACCTGC 


TGGGCATTCG 


CAAGAGGCAG 


CCCTTCCAGG 


14700 


AAGGCTTCCA 


GATCATGTAC 


GAGGATCTGG 


AGGGGGGCAA 


CATCXrCCGCG 


CTCCTGGATG 


14760 


TCGACGCCTA 


TGAGAAAAGC 


AAG6AGGATG 


CAGCAGCTGA 


AGCAACTGCA 


GCCGTAGCTA 


14820 


CCGCCTCTAC 


CGAGGTCAGG 


GGCGATAATT 


TTGCAAGCGC 


CGCAGCAGTG 


GCAGCGGCCG 


14880 


AGGOGGCTGA 


AACCGAAAGT 


AA6ATAGTCA 


TTCAGCCGGT 


GGAGAAGGAT 


AGCAAGAACA 


14940 


GGA6CTACAA 


C6TACTACCG 


GACAAGATAA 


ACACCGCCTA 


0CGCA6CTGG 


TACCTAGCCT 


15000 


ACAACTATGG 


CGACCCOGAG 


AAGGGCGTGC 


GCTCCTGGAC 


GCTGCTCACC 


ACCTCGGACG 


15060 


TCACCTGCGG 


CGTGGAGCAA GTCTACTGGT 


OGCTGCCCGA 


CATGATGCAA 


6ACCCGGTCA 


15120 


CCTTCOGCTC 


CACGCGTCAA 


GTTAGCAACT 


ACCCGGTGGT 


GGGCGCCGAG 


CTCCTGCCCG 


15180 


TCTACTCCAA 


GAGCTTCTTC 


AACGAGCAGG 


CCGTCTACTC 


GCAGCAGCTG 


CGCGCCTTCA 


15240 


CCTCGCTTAC 


GCACGTCTTC 


AACCGCTTCC 


CCGAGAACCA 


GATCCTCGTC 


CGCCCGCCCG 


15300 


CGCCCACCAT 


TACCACCGTC 


AGTGAAAACG 


TTCCTGCTCT 


CACAGATCAC 


GGGACCCTGC 


15360 


CGCTGCGCAG 


CAGTATCCGG 


GGAGTCCAGC 


GCGTGACCGT 


TACTGACGCC 


A6ACGCCGCA 


15420 


CCTGCCCCTA 


CGTCTACAAG 


GCCCTGGGCA 


TAGTOGCGCC 


GCGCGTCCTC 


TOGAGCCGCA 


15480 


CCTTCTAAAT 


GTCCATTCTC ATCT060CCA 


GTAATAAGAC 


06GTTGGGGC 


CTGCGCGCGC 


15540 


CCAGCAAGAT 


GTACGGAGGC 


GCTOGCCAAC 


GCTCCACGCA 


ACACCCCGTG 


CGCGTGCGCG 


15600 


G6CACTTCCG 


CGCTCCCT6G 


GGOGCCCTCA 


AGGGCCGOGT 


GCGGTOGCGC 


ACCACCGTOG 


15660 


ACGACGTGAT 


CGACCAGGTG 


GTGGCCGACG 


CGCGCAACTA 


CACCCCCGCC 


GCCGCGCCOG 


15720 


TCTCCACCGT 


GGACGCCXSTC 


ATCGACAGCG 


TGGTGGCGGA 


CGCGCGCCGG 


TACGCCCGCG 


15780 


CCAAGAGCCG 


GCGGCGGCGC 


ATCXSCCCGGC 


GGCACCGGAG 


CACCCCCGCC 


ATGCGCGCGG 


15840 


CX5CX5AGCCTT 


GCTGCGCAGG 


GCCAGGCGCA 


CGGGACGCAG 


GGCCATGCTC 


AGGGCGGCCA 


15900 


GACGCGCGGC 


TTCAGGCGCC 


AGCGCCGGCA 


GGACCCGGAG 


ACGCGCGGCC 


ACGGCGGCGG 


15960 
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CAGCGGCCAT 


CGCCA6CATG 


TCCCGCCCGC 


GGCGAGGGAA 


CGTGTACTGG 


GTGCGCGACG 


16020 


CC6CCACCGG 


TGTGCGCGTG 


CCCGTGCGCA 


CCCGCCCCCC 


TCGCACTTGA 


AGATGTTCAC 


16080 


TTC6CX»ATGT 


TGATGTOTCC 


CAGCGG06AG 


GAGGATGTCC 


AAGCGCAAAT 


TCAAGGAAGA 


16140 


GATGCTCCAG 


67CATC60GC 


CTGAGATCTA 


OGGCCCTGOG 


GTGGTGAAGG 


AGGAAAGAAA 


16200 


GCCC06CAAA 


ATCAAGCG6G 


TCAAAAAGGA 


GAAAAAGGAA 


GAAGAAAGTG 


ATGTGGACGG 


16260 


ATTGGTGGA6 


TTTGTG06CS6 


A6TTC60CCC 


CCGGOGGCGC 


GT6CAGTGGC 


GCGGGOGGAA 


16320 


6GTGCAAC0G 


GTGCT6A6AC 


CC6GCACCAC 


CGTGGTCTTC AOGCCCGGCG 


A6CGCTC0GG 


16380 


CACC6CTTCC 


AAGC6CTCCT 


AOGAOGAGGT 


GTACGGGGAT 


GATGATATTC 


TGGAGCAGGC 


16440 


GGCOGAGCGC 


CTGGGCGA6T 


TTGCTTACGG 


CAAGCGCAGC 


CGTTCOGCAC 


OGAAGGAAGA 


16500 


GGCGGTGTCC 


ATCCCGCTGG 


ACCAOGGCAA 


CCCCACGCCX; 


AGCCTCAAGC 


CCGTGACCTT 


16560 


GCAGGAGGTG 


CTGCCGACCG 


CGGCGCCGCG 


CCGGGGGTTC 


AAGCGCGAGG 


6CGAGGATCT 


16620 


GTACCCCACC 


ATGCAGCTGA 


TGCTGCCCAA 


GCGCCAGAAG 


CTGGAAGACG 


TGCTGGAGAC 


16680 


CATGAAGGTG 


GACCCGGACG 


TGCAGCCCGA 


GGTCAAGGTG 


CGGCCCATCA 


AGCAGGTGGC 


16740 


CCOGGGCCTG 


GGCGTGCAGA 


COGTGGACAT 


CAAGATTCCC 


ACXX3AGCCCA 


TGGAAAOGCA 


16800 


GACCGA6CCC 


ATGATCAAGC 


CCAGCACCAG 


CACCATGGAG 


GTGCAGACGG 


ATCCCTGGAT 


16860 


GCCATCGGCT 


CCTAGTOGAA 


GACCCC6G0G 


CAAGTAOGGC 


GCGGCCAGCC 


TGCTGATGCC 


16920 


CAACTACGCG 


CTGCATCCTT 


CCATCATCCC 


CA0GC06GGC 


TACCGOGGCA 


CG06CTTCTA 


16980 


CCGCGGTCAT 


ACGAGCAGCC 


GCCGCCGCAA 


6A0CACCACT 


OGCCGCOGCC 


GTCGCC!GGAC 


17040 


CGCCGCTGCA 


ACCACCCCTG 


CCGCCCTGGT GCGGAGAGTG TACCGC06CG 


GCCGCGCACC 


17100 


TCTGACCCTG 


CCGCGCGCGC 


GCTACCACCC 


GAGCATCGCC 


ATTTAAACTT 


TCGCCAGCTT 


17160 


TGCAGATCAA 


TGGCCCTCAC 


ATGCCGCCTT 


CGCGTTCCCA 


TTACGGGCTA 


CCGAGGAAGA 


17220 


AAACOGCGCC 


GTAGAAGGCT 


GGCGGGGAAC 


GGGATGCGTC 


GCCACCACCA 


CCGGCGGCGG 


17280 


CGOGCCATCA 


GCAAGCGGTT 


GGGGGGAGGC 


TTCCTGCCCG 


CGCTGATCCC 


CATCATCGCC 


17340 


GCGGOGAT06 


GGGCGATCCC 


OGGCATTGCT 


TCOGTGGCGG 


TGCAGGCCTC 


TCAGCGCCAC 


17400 


T6AGACACAC 


TTGGAAACAT 


CTTGTAATAA 


ACCCATGGAC 


TCTGACGCTC 


CTGGTCCTGT 


17460 


GATGT6TTTT 


CGTAGACA6A 


TGGAAGACAT 


GAATTTTTCG 


TCCCTGGCTC 


CGCGACAOGG 


17520 


CACGCGGCCG 


TTCAtGGGCA 


CCTG6AGCGA CATC6GCACC AGCCAACTGA 


ACGGGGGCGC 


17580 


CTTGAATTGG 


A6CAGTCTCT 


GGAGCGGGCT 


TAAGAATTTC 


GGGTCCACGC 


TTAAAACCTA 


17640 


T66CAGCAAG 


GCGTGGAACA 


GCACCACAGG 


GCAGGCGCTG 


AGGGATAAGC 


TGAAAGAGCA 


17700 


GAACTTCCAG 


CAGAAGGTGG 


TC6AT66GCT 


CGCCTCGG6C 


ATCAACG6GG 


TGGTGGACCT 


17760 


GGCCAACCAG 


GCCGTGCAGC 


GGCAGATCAA 


CAGCCGCCTG 


GACCCGGTGC 


CGCCCGCCGG 


17820 


CTCCGTGGAG 


ATGCCGCAGG 


TGGAGGAG6A 


GCTGCCTCCC 


CTGGACAAGC 


GGGGCGAGAA 


17880 
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GCGACCCC6C 


CCCX3ATGCGG 


AGGAGACGCT 


GCTGACGCAC 


ACGGACOAGC 


CGCCCCCGTA 


17940 


CX3AG6A6GC6 


GTGAAACTGG 


GTCTGCOCAC 


CACGCX3GCCC 


ATCGOGCCCC 


TGGCCACCGG 


18000 


GGTGCT6AAA 


cccx;aaaa6c 


CCGOGACCCT 


GGACTTGCCT 


CCTCCCCAGC 


CTTCCCGCCC 


18060 


CTCTACAGTG 


GCTAAGCCCC 


TGCCGCOGGT 


GGCCGTGGCC 


0GCGCX30GAC 


COGGGGGCAC 


18120 


06CC06CCCT 


CATGOGAACT 


GGCAGAGCAC 


TCTGAACAGC 


ATCGTGGGTC 


TGGGAGTGCA 


18180 


GA6T6T6AAG 


CGCCGC06CT 


GCTATTAAAC 


CTACCGTAGC 


GCTTAACTTG 


CTTGTCTGTG 


18240 


TGTGTATGTA 


TTATGTOGCC 


GCCGCCGCTG 


TCCACCAGAA 


6GAGGAGTGA 


AGAGGCGCGT 


18300 


C6CCGAGTTG 


CAAGATGGCC 


ACCCCATCGA 


TGCTGCCCCA 


GTGGGCGTAC 


ATGCACATCG 


18360 


CCGGACAGGA 


CGCTTCGGAG 


TACCTGAGTC 


CGGGTCTGGT 


GCAGTTTGCC 


CGCGCCACAG 


18420 


ACACCTACTT 


CAGTCTGGGG 


AACAAGTTTA 


GGAACCCCAC 


GGTGGCGCCC 


ACGCACGATG 


18480 


TGACCACCGA 


CCGCAGCCAG 


CGGCTGACGC 


TGCGCTTCGT 


GCCCGTGGAC 


CGCGAGGACA 


18540 


ACACCTACTC 


GTACAAAGTG 


CGCTACACGC 


TGGCCGTGGG 


CX3ACAACCGC 


GTGCTGGACA 


18600 


TGGCGAGCAC 


CTACTTTGAC 


ATCCGCGGCG 


TGCTGGATCG 


GGGCCCTAGC 


TTCAAACCCT 


18660 


ACTCCGGCAC 


CGCCTACAAC 


AGTCTGGCCC 


CCAAGGGAGC 


ACCCAACACT 


TGTCAGTGGA 


18720 


CATATAAAGC 


CGATGGTGAA 


ACTGCCACAG 


AAAAAACCTA 


TACATATGGA 


AATGCACCCG 


18780 


T6CAGGGCAT 


TAAGATCACA 


AAAGATGGTA 


TTCAACTT6G 


AACTGACACC 


GATGATCAGC 


18840 


CAATCTACGC 


AGATAAAACC 


TATCA6CCTG 


AACCTCAAGT 


GGGTGATGCT 


GAATGGCATG 


18900 


ACATCACTGG 


TACTGATGAA 


AAGTATGGAG 


GCAGAGCTCT 


TAAGCCTGAT 


ACCAAAATGA 


18960 


AGCCTT6TTA 


TGGTTCTTTT 


GCCAAGCCTA 


CTAATAAAGA 


AGGAGGTCAG 


GCAAATGTGA 


19020 


AAACAGGAAC 


AGGCACTACT 


AAAGAATATG 


ACATAGACAT 


GGCTTTCTTT 


GACAACAGAA 


19080 


GTGCGGCTGC 


TGCTGGCCTA 


GCTCCAGAAA 


TTGTTTTGTA 


TACTGAAAAT 


GTGGATTTGG 


19140 


AAACTCCAGA 


TACCCATATT 


GTATACAAAG 


CAGGCACAGA 


TGACAGCAGC 


TCTTCTATTA 


19200 


ATTT6GGTCA 


GCAAGCGATG 


CCCAACAGAC 


CTAACTACAT 


TGGTTTCAGA 


GACAACTTTA 


19260 


TCGGGCTCAT 


GTACTACAAC 


AGCACTGGCA 


ATATGGGGGT 


GCTGGCCGGT 


CAGGCTTCTC 


19320 


AGCTGAATGC 


TGTGGTTGAC 


TTGCAAGACA 


GAAACACCGA 


GCTGTCCTAC 


CAGCTCTTGC 


19380 


TTGACTCTCT 


GGGTGACAGA 


ACCCGGTATT 


TCAGTAT6TG 


GAATCAGGCG 


GTGGACAGCT 


19440 


ATGATCCTGA 


•PfifTGCCCATT 


ATTGAAAATC 


ATGGTGT6GA 


GGATGAACTT 


CCCAACTATT 


19500 


GTTTCCCTCT 


GGATGCTGTT 


GGCAGAACAG 


ATACTTATCA 


GGGAATTAAG 


GCTAATGGAA 


19560 


CTGATCAAAC 


CACATGGACC 


AAAGATGACA 


GTGTCAATGA 


TGCTAATGAG 


ATAGGCAAGG 


19620 


GTAATCCATT 


CGCCATGGAA 


ATCAACATCC 


AAGCCAACCT 


GTGGAGGAAC 


TTCCTCTACG 


19680 


CCAACGTGGC 


CCTGTACCTG 


CCCGACTCTT 


ACAAGTACAC 


GCCGGCCAAT 


GTTACCCTGC 


19740 


CCACCAACAC 


CAACACCTAC 


GATTACATGA 


ACGGCCGGGT 


GGTGGCGCCC 


TCGCTGGTGG 


19800 
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ACTCCTACAT 


CAACATCGGG 


GCGCGCTGGT 


CGCTGGATCC 


CATGGACAAC 


GTGAACCCCT 


19660 


TCAACCACCA 


CCGCAATGCG 


GGGCTGCGCT 


ACCGCTCCAT 


GCTCCTGGGC 


AACGGGCGCT 


19920 


ACGTGCCCTT 


CCACATCCAG 


GTGCCCCAGA AATTTTT06C 


CATCAAGAGC 


CTCCTGCTCC 


19980 


TGCCCGGGTC 


CTACACCTAC 


GA6TGGAACT 


TCOOCAAGGA 


COTCAACATG 


ATCCTGCAGA 


20040 


GCTCCCTCOG 


CAACGACCT6 


06CA0GGAGG 


GGGCCTCCAT 


CTCCTTCACC 


AGCATCAACC 


20100 


TCTACGCCAC 


CTTCTTCCCC 


ATGG06CACA ACAOGGCCTC 


CACGC7CGAG 


GCCATGCTGC 


20160 


6CAA06ACAC 


CAACGACCAG 


TCCTTCAA06 


ACTACCTCTC 


G6CGG0CAAC 


ATGCTCTACC 


20220 


CGATCCCGGC 


CAACGCCACC 


AAOGTGCCCA 


TCTCCATCCC 


CTCGCGCAAC 


TGGGCCGCCT 


20280 


TCCGOGGCTG 


GTCCTTCACG 


CGTCTCAAGA 


CCAAGGAGAC 


GCCCTCGCTG 


GGCTCCGGGT 


20340 


TCGACCCCTA 


CTTCGTCTAC 


TCGGGCTCCA 


TCCCCTACCT 


OGAOGGCACC 


TTCTACCTCA 


20400 


ACCACACCTT 


CAAGAAGGTC 


TCCATCACCT 


TOGACTCCTC 


CGTCAGCTGG 


CCCGGCAACG 


20460 


ACCGGCTCCT 


GACGCCCAAC 


GAGTTCGAAA 


TCAAGCGCAC 


CGTCGACGGC 


GAGGGCTACA 


20520 


ACGXGGCCCA 


GTGCAACATG 


ACGAAGGACT 


GGTTCCTGGT 


CCAGATGCTG 


GCCCACTACA 


20580 


ACATCGGCTA 


CCAGGGCTTC 


TACGTGCCGX; 


AGG6CTACAA 


GGACCGCATG 


TACTCCTTCT 


20640 


TCCGCAACTT 


CCAGCCCATG 


A6CCGCCAGG 


TGGTGGACGA 


GGTCAACTAC 


AAGGACTACC 


20700 


AG6CCGTCAC 


CCTGGCCTAC 


CAGCACAACA ACTCGGGCTT 


CGTOGGCTAC 


CTCGCGCCCA 


20760 


CCATGCGCCA 


6GGCCAGCCC 


TACCCCGCCA 


ACTACCCCTA 


CC06CTCATC 


GGCAAGAGCG 


20820 


CC6TCACCA6 


CGTCACCCAG 


AAAAAGTTCC 


TCT6CGACAG 


GGTCATGTGG 


CGCATCCCCT 


20880 


TCTCCAGCAA 


CTTCATGTCC 


ATG6GCGCGC 


TCACCGACCT 


OGGCCAGAAC 


ATGCTCTATG 


20940 


CCAACTCCGC 


CCACGCGCTA 


GACAT6AATT 


TCGAAGTCGA 


CCCCATGGAT 


6AGTCCACCC 


21000 


TTCTCTATGT 


TGTCTTCGAA 


GTCTTCGACG 


TCGTCCXJAGT 


GCACCAGCCC 


CACCGCGGCG 


21060 


TCATCGAGGC 


OGTCTACCTG 


CGCACCCCCT 


TCTCGGCCGG 


TAACGCCACC 


ACCTAAGCTC 


21120 


TTGCTTCTTG 


CAAGCCATGG 


CCGOGGGC7C 


OGGCGAGCAG 


GAGCTCAGGG 


CCATCATCCG 


21180 


C0ACCTGG6C 


TG06GGCCCT 


ACTTCCTGGG 


CACCTTCGAT 


AAGCGCTTCC 


CGGGATTCAT 


21240 


GGCXICCGCAC 


AAGCTGGCCT 


GOGCCATCGT 


CAACACGGCC 


GGCCGOGAGA 


CCGGGGGCGA 


21300 


GCACTGGCTG 


GCCTTCGCCT 


GGAACCCGCG 


CTCGAACACC 


TGCTACCTCT 


TOGACCCCTT 


21360 


CGOGTTCTCG 


GACGAGCGCC 


TCAAGCAGAT 


CTACCAGTTC 


GAGTACGAGG 


GCCTGCTGCG 


21420 


COGCAGCGCC 


CTGGCCACCG 


AGGACCXSCTG 


OGTCACCCTG 


GAAAAGTCCA 


CCCAGACCGT 


21480 


6CAGGGTCCG 


CGCTCGGCCG 


CCTGCGGGCT 


CTTCTGCTGC 


AT6TTCCTGC 


ACSGCCTTOGT 


21540 


GCACT6GCCC 


GACCGCCCCA 


TGGACAAGAA 


CCCCACCATG 


AACTTGCTGA 


CGGGGGTGCC 


21600 


CAAOGGCATG 


CTCCAGTCGC 


CCCAGGTGGA 


ACCCACCCTG 


CGCCGCAACC 


AGGAGGCGCT 


21660 


CTACCGCTTC 


CTCAACTCCC 


ACTCCGCCTA 


CTTTOGCTCC 


CACCGCGCGC 


GCATCGAGAA 


21720 
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66CCAC0GCC 


TTCGACCGCA 


TGAATCAAGA 


CATGTAAACC 


GTGTGTGTAT 


GTTAAATGTC 


21780 


TTTAATAAAC 


AGCACTTTCA 


TGTTACACAT 


GCATC7GAGA 


TGA7TTATTT 


AGAAATCGAA 


21840 


AGGGTTCT6C 


CGGGTCTOGG 


CATGGCCOGC 


GGGCAGGGAC 


AOGTTGCGGA 


ACTGGTACTT 


21900 


G6CCA6CCAC 


TT6AACTCGG 


GGATCAGCA6 


TTTGGGCAGC 


GGGGTGTOGG 


GGAAGGA6TC 


21960 


G6TCCACA6C 


TTC06CGTCA 


GTTGCAGGGC 


GCCCAGCAGG 


TCGGGCGCGG 


AGATCTTGAA 


22020 


AtCGCAGTTG 


GGACCCGCXST 


TCTGOGOGCSG 


GGAGTTGOGG 


TACAOGGGGT 


TGCAGCACTG 


22080 


GAACACCATC 


AGGGCOGGGT 


GCTTCACXiCT 


CGCCAGCACC 


GTOGCGTCGG 


TGATGCTCTC 


22140 


CACGTC6A6G 


TCCTCGGCGT 


TGGCCATCCC 


GAAGGGGGTC 


ATCTTGCAGG 


TCTGCCTTCC 


22200 


CATGGTGGGC 


ACGCACCCGG 


GCTTGTGGTT 


GCAATCGCAG 


TGCAGGGGGA 


TCAGCATCAT 


22260 


CTGGGCCTGG 


TCGGCGTTCA 


TCCCCX^GGTA 


CATGGCCTTC 


ATGAAAGCCT 


CCAATTGCCT 


22320 


GAACGCCTGC 


TGGGCCTTGG 


CTCCCTCGGT 


GAAGAAGACC 


CCX;CAGGACT 


T6CTAGAGAA 


22380 


CTGGTTGGTG 


GCGCACCCGG 


CGTCGTGCAC 


GCAGCAGCX3C 


GCGTCGTTGT 


TGGCCAGCTG 


22440 


CACCACGCTG 


CGCCCCCAGC 


GGTTCTGGGT 


GATCTTGGCC 


CGGTOGGGGT 


TCTCCTTCAG 


22500 


CG06CGCTGC 


CCGTTCTCGC 


TCGCCACATC 


CATCTCGATC 


ATGTGCTCCT 


TCTGGATCAT 


22560 


GGT6GTCCCG 


TGCAGGCACC 


GCAGCTTGCC 


CTCGGCCTCG 


GTGCACCCGT 


GCAGCCACAG 


22620 


CGOGCACCCG 


GTGCACTCCC 


AGTTCTTGTG 


GGCGATCTGG 


GAATGCGCGT 


GCAOGAAGCC 


22680 


CTGCAGGAAG 


CX3GCCCATCA 


TGGTGGTCAG 


GGTCTTGTTG 


CTAGTGAAGG 


TCAGCGGAAT 


22740 


GCCGCGGTGC 


TCCTCGTTGA 


TGTACAGGTG 


GCAGATGCGG 


CGGTACACCT 


CGCCCTGCTC 


22800 


GGGCATCAGC 


TGGAAGTTGG 


CTTTCA6GTC 


GGTCTCCACG 


CGGTAGCGGT 


CCATCAGCAT 


22860 


AGTCATGATT 


TCCATACCCT 


TCTCCCAGGC 


CGAGACGATG 


GGCAGGCTCA 


TAGGGTTCTT 


22920 


CACCATCATC 


TTAGOGCTAG 


CAGCCGCGGC 


CAG6GGGTCG 


CTCTCGTCCA 


GGGTCTCAAA 


22980 


GCTCOGCTTG 


CCGTCCTTCT 


CGGTGATCCG 


CACCGGGGGG 


TAGCTGAAGC 


CCAOGGCOGC 


23040 


CAGCTCCTCC 


TCGGCCTGTC 


TTTCGTCX!TC 


GCTGTCCTGG 


CTGAOGTCCT 


GCAGGACCAC 


23100 


ATGCTTGGTC 


TTGCGGGGTT 


TCTTCTTGGG 


CGGCAGCGGC 


GGCGGAGATG 


TTGGAGATGG 


23160 


OGAGGGGGAG 


CGCGAGTTCT 


CGCTCACCAC 


TACTATCTCT 


TCCTCTTCTT 


GGTCOGAGGC 


23220 


CACGC6GCGG 


TAGGTATGTC 


TCTTCGGGGG 


CAGAGGCGGA 


GGCGACGGGC 


TCTCGCCGCC 


23280 






CAGAGCCCCT 


TCC60GTTCG 


GGGGTG06CT 


CCCGGOGGCG 


23340 


CTCTGACTGA 


CTTCCTCOGC 


GGCCGGCCAT 


T6TGTTCTCC 


TAGGGAGGAA 


CAACAAGCAT 


23400 


6GAGACTCAG 


CCATCGCCAA 


CCTCGCCATC 


TGCCCCCACC 


GCCGACGAGA 


AGCAGCAGCA 


23460 


GCAGAATGAA 


AGCTTAACCG 


CCCCGCCGCC 


CAGCCCCGCC 


ACCTCCGACX5 


CGGCCGTCCC 


23520 


AGACATGCAA 


GAGATGGAGG 


AATCCATCGA 


GATTCACCTG 


GGCTATGTGA 


CGCCC6GGGA 


23580 


GCACGAGGAG 


GAGCTGGCAG 


TGCGCTTTTC 


ACAAGAAGAG 


ATACACCAAG 


AACAGCCAGA 


23640 
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GCA6GAA6CA 


GAGAATGAGC 


AGAGTCAGGC 


TGGGCTCGAG 


CATGACGGCG 


ACTACCTCCA 


23700 


CCTGAGC6GG 


GGGGAGGACG 


CGCTCATCAA 


GCATCTGGCC 


OGGCAGGCCA 


CCATCGTCAA 


23760 


GGATGCGCTG 


CTC6AC0GCA 


CCGAGGTGCC 


CCTCAGGGTG 


GAGGAGCTCA 


GCCGCGCCTA 


23820 


C6A6TTGAAC 


CTCTTCTOGC 


0GCGC3GTGCC 


CCCCAA60GC 


CAGCCCAATG 


GGACCTGOGA 


23880 


GCCCAACCOG 


OGCCTCAACT 


TCTACCCGGT 


CTTCGOGGTG 


CCCGAGGCCC 


TGGCCACCTA 


23940 


CCACATCTTT 


TTCAAGAACC 


AAAAGATCCC 


OGTCTCCTGC 


CGOGCGAACC 


GCACCCGCGC 


24000 


CGACGCCCTT 


TTCAACCTGG 


GTCCCGGOGC 


CCGCCTACCT 


GATATCGCCT 


CCTTGGAAGA 


24060 


GGTTCCCAAG 


ATCTTCGAGG 


GTCTGGGCAG 


OGACGAGACT 


OGGGCCGOGA 


ACGCTCTGCA 


24120 


AGGAGAAGGA 


GGAGAGCATG 


AGCACCACAG 


OGCCCTGGTC 


GAGTTGGAAG 


GCGACAACGC 


24180 


GCGGCTGGC6 


GTGCTCAAAC 


GCACGGTCGA 


GCTGACCCAT 


TTCGCCTACC 


CX5GCTCTGAA 


24240 


CCTGCCCCCC 


AAAGTCATGA 


GCGCX5GTCAT 


GGACCA6GTG 


CTCATCAAGC 


GCGCGTCGCC 


24300 


CATCTCCGAG 


GACGAGGGCA 


TGCAAGACTC 


CX^AGGAGGGC 


AAGCCCGTGG 


TCAGCGACGA 


24360 


GCAGCTGGCC 


CGGTGGCXGG 


GTCCTAATGC 


TAGTCCCCAG 


AGTTTGGAAG 


AGCGGCGCAA 


24420 


ACTCATGATG 


GCCGTGGTCC 


TGGTGACOGT GGAGCTGGAG 


TGCCTGOGCC 


GCTTCTTCGC 


24480 


CGAC606GAG ACCCTG06CA AGGTOGAGGA GAACCT6CAC TACCTCTTCA GGCACG6GTT 


24540 


CGT6C6CCAG 


GCCTGCAAGA 


TCTCCAACGT 


GGAGCTGACC 


AACCTGGTCT 


CCTACATGGG 


24600 


CATCTTGCAC 


GAGAACCGCC 


TGGGGCAGAA 


CGTCCTGCAC 


ACCACCCTGC 


GCGGG6AGGC 


24660 


CCGGCGCGAC 


TACATCC6CG ACTGCGTCTA 


CCTCTACCTC 


TGCCACACCT 


GGCAGACGGG 


24720 


CATGGGCGTG 


TGGCA6CAGT 


GTCTGGAGGA 


GCAGAACCTG 


AAAGAGCTCT 


GCAAGCTCCT 


24780 


GCAGAAGAAC 


CTCAAGGGTC 


TGTGGACCGG 


GTTCGACGAG 


CGCACCACCG 


CCTCGGACCT 


24840 


GGCCGACCTC 


ATTTTCCCCG 


AGCGCCTCA6 


GCTGACGCTG 


CGCAACGGCC 


TGCCCGACTT 


24900 


TATGAGCCAA 


AGCATGTTGC AAAACTTTCG 


CTCTTTCATC 


CTCGAACGCT 


CCGGAATCCT 


24960 


GCCCGCCACC 


TGCTCCGOGC 


TGCCCTCG6A 


CTTOGTGCCG 


CTGACCTTCC 


GCGAGTGCCC 


25020 


CCCGCCGCTG 


TGGAGCCACT 


GCTACCTGCT 


GCXKTCTGGCC 


AACTACCTGG 


CCTACCACTC 


25080 


GGACGTGATC 


GAGGAOGTCA 


60GGCGAGGG 


CCTGCTCGAG 


TGCCACTGCC 


GCTGCAACCT 


25140 


CTGCAOGCCG 


CACCGCTCCC 


TGGCCTGCAA 


CCCCCAGCTG 


CTGAGOGAGA 


CCCAGATCAT 


25200 


C6GCACCTTC 


GAGTTGGAA6 


GGCCCAGOGA 


AGGCGAGGGT 


TCAGCCGCCA 


AGGGGGGTCT 


25260 


GAAACTCACC 


CCG6G6CTG7 


GGACCTOGGC 


CTACTTGCGC 


AAGTTOGTGC 


CCGAGGACTA 


25320 


CCATCCCTTC 


GA6ATCAGGT 


TCTACGAGGA 


CCAATCCCAT 


CCGCCCAAGG 


CCGAGCTGTC 


25380 


GGCCTGC5GTC 


ATCACCCAGG 


GGGCGATCCT 


GGCCCAATTG 


CAAGCCATCC 


AGAAATCCCG 


25440 


CCAAGAATTC 


TTGCTGAAAA 


AGGGCCGCGG 


GGTCTACCTC 


GACCCCCAGA 


CCGGTGAGGA 


25500 


GCTCAACCCC 


GGCTTCCCCC 


AGGATGCCCC 


GAGGAAACAA 


GAAGCTGAAA 


GTGGAGCTGC 


25560 
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CGCCCGTGGA 


GGATTTGGAG 


GAAGACTGGG 


AGAACAGCAG 


TCAGGCAGAG 


GAGGAGGAGA 


25620 


TGGAGGAA6A 


CTGGGACAGC 


ACTCAGGCAG 


AGGAGGAGAG 


CCTGCAAGAC 


AGTCTGGAGG 


25680 


AAOACX3A60A 


GGAGGCAGA6 


GAGGAG6TGG 


AAGAA6CAGC 


CGCCGCCAGA 


COGTCGTCCT 


25740 


CX3606GGGGA 


6AAAGCAA6C 


AGCAOGGATA 


CCATCTOOGC 


TCCGGGT06G 


GGTCCCGCTC 


25800 


OACCACACAG 


TAGATGGGAC 


GAGACOGGAC 


GATTCCOGAA 


CCCCACCACC 


CAGACCGGTA 


25860 


A6AAGGAG06 


GCAGGGATAC 


AAGTCCTGGC 


GGGGGCACAA AAACGCCATC 


GTCTCCTGCT 


25920 


T6CAGGCCTG 


CGGGGGCAAC 


ATCTCCTTCA 


CCCGGCGCTA 


CCTGCTCTTC 


CAC0GCG6GG 


25980 


T6AACTTTCC 


COGCAACATC 


TTGCATTACT 


ACCGTCACCT 


CCACAGCCCC 


TACTACTTCC 


26040 


AAGAAGA6GC 


AGCAGCAGCA 


GAAAAAGACC 


AGCAGAAAAC 


CAGCAGCTAG 


AAAATCCACA 


26100 


GCGGCGGCAG 


CAGGTGGACT 


GAGGATCGCG 


GCGAACGAGC 


CGGCGCAAAC 


CCGGGAGCTG 


26160 


AGGAACCGGA 


TCTTTCCCAC 


CCTCTATGCC 


ATCTTCCAGC 


AGAGTCGGGG 


GCAGGAGCAG 


26220 


GAACT6AAAG 


TCAA6AAC06 


TTCTCTGOGC 


TOGCTCACCC 


GCAGTTGTCT 


GTATCACAAG 


26280 


AGOGAAGACC 


AACTTCA606 


CACTCTOGAG 


GAOGCOGAGG 


CTCTCTTCAA 


CAAGTACTGC 


26340 


G06CTCACTC 


TTAAA6AGTA 


GCCCGCaCCC 


6COCAGTOGC 


AGAAAAAGGC 


GGGAATTACG 


26400 


TCACCTGTGC 


CCTTCGCCCT 


AGCCGCCTCC ACCCATCATC 


ATGAGCAAAG 


AGATTCCCAC 


26460 


GCCTTACATG 


TGGAGCTACC 


A6CCCCAGAT GGGCCTGGCC 


GCCGGTGCCG 


CCCAGGACTA 


26520 


CTCCACCCGC 


ATGAATTGGC 


TCAGOGCCGG 


GCCCGCGATG 


ATCTCACGGG 


TGAATGACAT 


26580 


CCG06CCCAC 


CGAAACCAGA 


TACTCCTAGA 


ACA6TCAGCG 


CTCACCGCCA 


CGCCCCGCAA 


26640 


TCACCTCAAT 


CCGCGTAATT 


6GCCCGCCGC 


CCTGGTGTAC 


CAGGAAATTC 


CCCAGCCCAC 


26700 


GACCGTACTA 


CTTCCGCGAG 


AOGCCCAGGC 


CGAAGTCCAG 


CTGACTAACT 


CAGGTGTCCA 


26760 


GCTGGCGGGC 


GGOGCCACCC 


TGTGTCGTCA 


CCXSCCCCGCT 


CAGGGTATAA 


AGCGGCTGGT 


26820 


GATCCGGGGC 


AGA6GCACAC 


AGCTCAAOGA 


OGAGGTGGTG 


AGCTCTTCGC 


TGGGTCTGCG 


26880 


ACCTGACGGA 


GTCTTCCAAC 


TOGCCGGATC 


GGGGAGATCT 


TCCTTCACGC 


CTCGTCAGGC 


26940 


06TCCTGACT 


TTGGAGAGTT 


CGTCCTOGCA 


GCCCOGCTCG 


GGTGGCATCG 


GGACTCTCCA 


27000 


GTTCGTGGA6 


GAGTTCACTC 


CCTOGGTCTA 


CTTCAACCCC 


TTCTCCGGCT 


CCCCCGGCCA 


27060 


CTACCOGGAC 


GAGTTCATCC 


OGAACTTOGA 


CGCCATGAGC GAGTCGGTGG 


ACGGCTACGA 


27120 


TT6AAT6TCC 


CAT6CTCGCG 


CAGCTGACCT 


AGCTCGGCTT 


CGAGACCTGG 




271 Oft 


C06CTTCCGC 


TGCTTCGCTC 


GGGATCTCGC 


CGAGTTTGCC 


TACTTTGAGC 


TGCCCGAGGA 


27240 


GCACCCTCAG 


GGCCCGGCCC 


ACGGAGTGCG 


GATCGTCGTC 


GAAGGGGGCC 


TCGACTCCCA 


27300 


CCTGCTTCGG 


ATCTTCAGCC 


AGCGTCCGAT 


CCTGGTCGAG 


CGCGAGCAAG 


GACAGACCCT 


27360 


TCTGACTCTG 


TACTGCATCT 


GCAACCACCC 


CGGCCTGCAT 


GAAAGTCTTT 


GTTGTCTGCT 


27420 


GTGTACTGAG 


TATAATAAAA 


GCTGAGATCA GOGACTACTC 


CGGACTTCCG 


TGTGTTCCTG 


27480 
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AATCCATCAA 


CCAGTCTTTG 


TTCTTCACCG 


GGAACGAGAC 


OGAGCTCCAG 


CTCCAGTGTA 


27540 


AGCCCCACAA 


GAA6TACCTC 


ACCTGGCTGT 


TCCAGGGCTC 


CCCGATCGCC 


GTTGTCAACC 


27600 


ACTGCGACAA 


CGACGGAGTC 


CTGCTGAGCG 


GCCCTGCCAA 


CCTTACTTTT 


TCCACCOGCA 


27660 


6AAGCAA6CT 


CCAGCTCTTC 


GAACCCTTCC 


TCCCCGGGAC 


CTATCAGTOC 


GTCTC6G6AC 


27720 


CCTGCCATCA 


CACCTTCCAC 


CT6ATCCC6A 


ATACCACA6C 


GTCGCTCOCC 


GCTACTAACA 


27780 


ACCAAACTAA 


CCTCCACCAA 


0GCCAC06TC 


GOGACCTTTC 


TGAATCTAAT 


ACTACCACCC 


27640 


ACACCGGAG6 


TGAGCTCCGA 


GGTCAACCAA 


CCTCTGGGAT 


TTACTAOGGC 


CCCTGGGAGG 


27900 


TGGTTG6GTT 


AATAGC6CTA 


GGCCTAGTTG 


CGGGTGGGCT 


TTTGGTTCTC 


TGCTACCTAT 


27960 


ACCTCCCTTG 


CTGTTCGTAC 


TTAGTGGTGC 


TGTGTTGCTG 


GTTTAAGAAA 


TGGG6AAGAT 


28020 


CACCCTA6TG 


AGCTGCGGTG 


OGCTGGTGGC 


GGTGTTGCTT 


TCGATTGTGG 


GACTGGGCGG 


28080 


TGCGGCTGTA 


GTGAAGGAGA 


AGGCCGATCC 


CTGCTTGCAT 


TTCAATCCCA 


ACAAATGCCA 


28140 


GCTGAGTTTT 


CAGCCCGATG 


GCAATCGGTG 


CGCGGTACTG 


ATCAAGTGCG 


GATGGGAATG 


28200 


CGAGAACGTG 


AGAATCGAGT 


ACAATAACAA 


GACTCGGAAC 


AATACTCTCG 


CGTCCGTGTG 


26260 


GCAGCCCGGG 


GACCCCGAGT 


GGTACACC6T 


CTCT6TCCCC 


GGTGCTGACG 


GCTCCCCGCG 


28320 


CACCGT6AAT 


AATACTTTCA 


TTTTT6CGCA 


CATGTGCGAC 


ACGGTCATGT 


GGATGAGCAA 


28380 


GCAGTACGAT 


ATGTGGCCCC 


CCACGAAGGA 


GAACATOGTG 


GTCTTCTCCA 


T06CTTACAG 


28440 


CCTGTGCACG 


GOGCTAATCA 


C06CTATCGT 


GTGCCTGAGC 


ATTCACATGC 


TCATCGCTAT 


28500 


TCGCCCCAGA 


AATAATGCCG 


AAAAAGAAAA 


ACA6CCATAA 


OGTTTTTTTT 


CAGACCTTTT 


28560 


TCAGACCATG 


GCCTCTGTTA 


AATTTTTGCT 


TTTATTTGCC 


AGTCTCATTG 


CCGTCATTCA 


28620 


TGGAATGAGT 


AATGA6AAAA 


TTACTATTTA 


CACTGGCACT 


AATCACACAT 


TGAAAGGTCC 


26660 


AGAAAAAGCC 


ACAGAAGTTT 


CATGGTATTG 


TTATTTTAAT 


GAATCAGATG 


TATCTACTGA 


28740 


ACTCTGTGGA 


AACAATAACA 


AAAAAAATGA 


GAGCATTACT 


CTCATCAAGT 


TTCAATGTGG 


28800 


ATCTGACTTA 


ACCCTAATTA 


ACATCACTAG 


AGACTATGTA 


GGTATGTATT 


ATGGAACTAC 


28860 


AGCAGGCATT 


TC6GACATGG 


AATTTTATCA 


AGTTTCT6TG 


TCTGAACCCA 


CCACGCCTAG 


28920 


AATGACCACA 


ACCACAAAAA 


CTACACCT6T 


TACCACTATG 


CAGCTCACTA 


CCAATAACAT 


28980 


TTTTGCCATG 


C6TCAAATGG 


TCAACAATA6 


CACTCAACCC 


acccx:accca 


GTGAGGAAAT 


29040 


TCCCAAATCC 


ATGATTGGCA 


TTATTGTTGC 


TGTAGTGGTG 


TGCATGTTGA 


TCATOGCCTT 


29100 


6TGCATGGTG 


TACTATGCCT 


TCT6CTACAG 


AAA6CACAGA 


CTGAAOGACA 


AGCTGGAACA 


29160 


CTTACTAAGT 


GTTGAATTTT 


AATTTTTTAG 


AACCATGAAG 


ATCCTAGGCC 


TTTTAATTTT 


29220 


TTCTATCATT 


ACCTCTGCTC 


TATGCAATTC 


TGACAATGAG 


GACGTTACTG 


TCGTTGTCGG 


29280 


ATCAAATTAT 


ACACTGAAAG 


GTCCAGCGAA 


GGGTATGCTT 


TCGTGGTATT 


GCTATTTTGG 


29340 


ATCTGACACT 


ACAGAAACTG 


AATTATGCAA 


TCTTAAGAAT 


GGCAAAATTC 


AAAATTCTAA 


29400 
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AATTAACAAT 


TATATATGCA 


ATGGTACTGA 


TCTGATACTC 


CTCAATATCA 


CGAAATCATA 


29460 


TGCTG6CA67 


TACACCTGCC 


CTGGAGATGA 


TGCTGACAGT 


ATGATTTTTT 


ACAAAGTAAC 


29520 


TGTTGTT6AT 


CCCACTACTC 


CACCTCCACC 


CACCACAACT 


ACTCACACCA 


CACACACA6A 


29580 


TCAAACCGCA 


GCA6AGOA6G 


CAGCAAAGTT 


AGCCTTGCAG 


GTCCAAGACA 


6TTCATTT6T 


29640 


T60CATTACC 


CCTACACCTG 


ATCAGC6GTG 


TCCGGGGCTG 


CTAGTCAGCG 


GCATTGTCGG 


29700 


TGTGCTTTC6 


GGATTAGCAG 


TCATAATCAT 


CTGCATGTTC 


ATTTTTGCTT 


6CTGCTATAG 


29760 


AAGGCTTTAC 


CGACAAAAAT 


CAGACCCACT 


GCTGAACCTC 


TATGTTTAAT 


TTTTTCCAGA 


29820 


GTCATGAAGG 


CAGTTAGCGC 


TCTAGTTTTT 


TGTTCTTTGA 


TTGGCATTGT 


TTTTTGCAAT 


29880 


CCTATTCCTA 


AAGTTAGCTT 


TATTAAAGAT 


GTGAATGTTA 


CTGAGGGGGG 


CAATGTGACA 


29940 


CTGGTAGGTG 


TAGAGGGTGC 


TGAAAACACC 


ACCTGGACAA 


AATACCACCT 


CAATGGGTGG 


30000 


AAAGATATTT 


GCAAT7GGAG 


TGTATTAGTT 


TATACATGTG 


AGGGAGTTAA 


TCTTACCATT 


30060 


GTCAATGCCA 


CCTCAGCTCA 


AAATGGTAGA ATTCAAGGAC 


AAAGTGTCAG 


TGTATCTAAT 


30120 


GGGTATTTTA 


CCCAACATAC 


TTTTATCTAT 


GAOGTTAAAG 


TCATACCACT 


GCCTACGCCT 


30180 


AOCCCACCTA 


6CACTACCAC 


ACAGACAACC 


CACACTACAC 


AGACAACCAC 


ATACAGTACA 


30240 


TTAAATCAGC 


CTACCACCAC 


TACA6CAGCA 


GAGGTTGCCA 


GCTCGTCTGG 


GGTCCGAGTG 


30300 


GCATTTTTGA 


TGTGGGCCCC 


ATCTAGCAGT 


CCCACTGCTA 


GTACCAATGA 


GCAGACTACT 


30360 


6AATTTTTGT 


CCACTGTOGA 


GAGCCACACC 


ACAGCTACCT 


CCAGTGCCTT 


CTCTAGCACC 


30420 


GCCAATCTCT 


CCTOGCTTTC 


CTCTACACCA 


ATCAGTCCCG 


CTACTACTCC 


TAGCCCCGCT 


30480 


CCTCTTCCCA 


CTCCCCTGAA 


GCAAACAGAC 


GGCGGCATGC 


AATGGCAGAT 


CACCCTGCTC 


30540 


ATTGTGATCG 


GGTTGGTCAT 


CCTGGCCGTG 


TTGCTCTACT 


ACATCTTCTG 


CCGCCGCATT 


30600 


CCCAACGCGC 


ACCGCAA6CC 


GGTCTACAAG 


CCCATCATTG 


TCGGGCAGCC 


GGAGCCGCTT 


30660 


CAGGTGGAAG 


GGGGTCTAAG 


GAATCTTCTC 


TTCTCTTTTA 


CAGTATGGTG 


ATTGAACTAT 


30720 


GATTCCTAGA 


CAATTCTTGA 


TCACTATTCT 


TATCTGCCTC 


CTCCAA6TCT 


GTGCCACCCT 


30780 


CGCTCTGGTG 


GCCAACGCCA 


GTCCAGACTG 


TATTGGGCCC 


TTCGCCTCCT 


ACGTGCTCTT 


30840 


TGCCTTCACC 


ACCTGCATCT 


GCTGCT6TAG 


CATAGTCTGC 


CTGCTTATCA 


CCTTCTTCCA 


30900 


GTTCATTGAC 


TGGATCTTT6 


TGCGCATCGC 


CTACCTGCGC 


CACCACCCCC 


AGTACCGCGA 


30960 


CCAGCGAGTG 


GOGCGGCTGC 


TCAGGCTCCT 


CTGATAAGCA 


TGOGGGCTCT 


GCTACTTCTC 


31020 


GCGCTTCTGC 


TGTTAGTGCT 


CCCCCGTCCC 


GTCGACCCCC 


GGTCCCCCAC 


CCAGTCCCCC 


31080 


GAGGAGGTCC 


GCAAATGCAA 


ATTCCAAGAA 


CCCTGGAAAT 


TCCTCAAATG 


CTACCGCCAA 


31140 


AAATCAGACA 


TGCATCCCA6 


CTGGATCATG 


ATCATTGGGA 


TCGTGAACAT 


TCTGGCCTGC 


31200 


ACCCTCATCT 


CCTTTGTGAT 


TTACCCCTGC 


TTTGACTTTG 


GTTGGAACTC 


GCCAGAGGCG 


31260 


CTCTATCTCC 


OGCCTGAACC 


TGACACACCA 


CCACAGCAAC 


CTCA6GCACA 


CGCACTACCA 


31320 
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CCACTACAGC 


CTAGGCCACA 


ATACATGCCC 


ATATTAGACT 


ATGAGGCCGA 


GCCACAGCGA 


31380 


CCCATGCTCC 


CCGCTATTAG 


TTACTTCAAT 


CTAACCGGCG 


GAGATGACTG 


ACCCACTGGC 


31440 


CAACAACAAC 


GTCAACGACC 


T7CTCCTGGA 


CATGGACGGC 


C6CGCCTCGG 


AGCAGCGACT 


31500 


OGCCCAACTT 


CGGATTOGCC AGGAGCAG6A 


6AGAGC0GTC 


AAGGAGCTGC 


AGGATGC6GT 


31560 


GGCCATCCAC 


CAGTGCAAGA GAGGCATCTT 


CTGCCTGGT6 


AAACAGGCCA AGATCTCCTA 


31620 


CGAG6TCACT 


CCAAACGACC ATCGCCTCTC 


CTACGAGCTC 


CTGCAGCAGC 


GCCAGAA6TT 


31680 


CACCTGCCT6 


GTCGGAGTCA ACCCCATOGT 


CATCACCCAG 


CAGTCTGGCG 


ATACCAAGGG 


31740 


GTGCATCCAC 


T6CTCCTGCG 


ACTCCCCCGA 


CTGCGTCCAC 


ACTCT6ATCA 


AGACCCTCTG 


31800 


CGGCCTCCGC 


GACCTCCTCC 


CCATGAACTA 


ATCACCCCCT 


TATCCAGTGA 


AATAAAGATC 


31860 


ATATTGATGA 


TGATTTTACA 


GAAATAAAAA 


ATAATCATTT 


GATTTGAAAT 


AAAGATAGAA 


31920 


TCATATTGAT 


GATTTGAGTT 


TAACAAAAAA 


ATAAAGAATC 


ACTTACTTGA 


AATCTGATAC 


31980 


CAGGTCTCTG 


TCCATGTTTT 


CTGCCAACAC 


CACTTCACTC 


CCCTCTTCCC 


AGCTCTGGTA 


32040 


CTGCA6GCCC 


CGGCGGGCTG 


CAAACTTCCT 


CCACAOGCTG 


AAGGG6ATGT 


CAAATTCCTC 


32100 


CTGTCCCTCA 


ATCTTCATTT 


TATCTTCTAT 


CAGATGTCCA 


AAAAGCGCGT 


C0GGGTG6AT 


32160 


GATGACTTCG 


ACCCCGTCTA 


CCCCTACGAT 


GCAGACAACG 


CACCGACCGT 


GCCCTTCATC 


32220 


AACCCCCCCT 


TCGTCTCTTC 


A6ATGGATTC 


CAAGA6AAGC 


CCCTGGGGGT 


GTTGTCCCTG 


32280 


CGACTGGCC!G 


ACCCCGTCAC 


CACCAAGAAC 


GGGGAAATCA 


CCCTCAA6CT 


GGGAGAG6GG 


32340 


GTGGACCTCG 


ATTCCTCGGG 


AAAACTCATC 


TCCAACACGG 


CCACCAAGGC 


CGCCGCCCCT 


32400 


CTCAGTTTTT 


CCAACAACAC 


CATTTCCCTT 


AACATGGATC 


ACCCCTTTTA 


CACTAAAGAT 


32460 


GGAAAATTAT 


CCTTACAAGT 


TTCTCCACCA 


TTAAATATAC 


TGAGAACAAG 


CATTCTAAAC 


32520 


ACACTAGCTT 


TAGGTTTTGG 


ATCAGGTTTA 


GGACTCCGTG 


GCTCTGCCTT 


GGCAGTACAG 


32580 


TTAGTCTCTC 


CACTTACATT 


TGATACTGAT 


GGAAACATAA 


AGCTTACCTT 


AGACAGAGGT 


32640 


TTGCATGTTA 


CAACAGGAGA 


TGCAATTGAA 


AGCAACATAA 


GCT6GGCTAA 


AGGTTTAAAA 


32700 


TTTGAAGATG 


GAGCCATAGC 


AACCAACATT 


GGAAAT6GGT 


TAGAGTTTGG 


AAGCAGTAGT 


32760 


ACAGAAACAG 


GTGTTGATGA 


TGCTTACCCA 


ATCCAAGTTA 


AACTTGGATC 


TGGCCTTA6C 


32820 


TTTGACAGTA 


CAGGAGCCAT 


AATGGCTGGT 


AACAAAGAAG 


ACGATAAACT 


CACTTTGTGG 


32880 


ACAACACCTG 


ATCCATCACC 


AAACTGTCAA 


ATACTOGCAG 


AAAATGATGC 


AAAACTAACA 


32940 


CTTTGCTTGA 


CTAAAT6TGG 


TAGTCAAATA 


CTGGCCACTG 


TGTCAGTCTT 


AGTTGTAGGA 


33000 


AGTGGAAACC 


TAAACCCCAT 


TACTGGCACC 


GTAAGCAGTG 


CTCAGGTGTT 


TCTACGTTTT 


33060 


GATGCAAACG 


GTGTTCTTTT 


AACAGAACAT 


TCTACACTAA 


AAAAATACTG 


GGGGTATAGG 


33120 


CAGGGAGATA 


GGATAGATGG 


CACTCCATAT 


ACCAATGCTG 


TAGGATTCAT 


GCCCAATTTA 


33180 


AAAGCTTATC 


CAAAGTCACA 


AAGTTCTACT 


ACTAAAAATA 


ATATAGTAGG 


GCAAGTATAC 


33240 
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ATGAATGGA6 


ATGTTTCAAA 


ACCTATGCTT 


CTCACTATAA 


CCCTCAATGG 


TACTGATGAC 


33300 


AGCAACA6TA 


CATATTCAAT 


GTCATTTTCA 


TACACCTGGA 


CTAATGGAAG 


CTATGTTGGA 


33360 


GCAACATTTG 


GGGCTAACTC 


TTATACCTTC 


TCATACATCG 


CCCAAGAATG 


AACACTGTAT 


33420 


CCCACCCT6C 


ATOCCAACCC 


TTCCCACCCC 


ACTCTGTGGA 


ACAAACTCTG 


AAACACAAAA 


33480 


TAAAATAAAG 


TTCAAGTOTT 


TTATTGATTC 


AACAGTTTTA 


CAGGATTCGA 


GCAGTTATTT 


33540 


TTCCTCCACC 


CTCCCAGGAC 


ATGGAATACA 


CCACCCTCTC 


CCCCCGCACA 


GCCTTGAACA 


33600 


TCT6AATGCC 


ATTGGTGATG 


GACATGCTTT 


TGGTCTCCAC 


GTTCCACACA 


GTTTCAGAGC 


33660 


GAGCCAGTCT 


CGGGT06GTC 


AG6GAGATGA 


AACCCTCCGG 


GCACTCCCGC 


ATCTGCACCT 


33720 


CACAGCTCAA 


CAGCTGAGGA 


TTGTCCTCGG 


TGGTCGGGAT 


CACGGTTATC 


TGGAAGAAGC 


33780 


agaagagcxk; 


CGGTGGGAAT 


CATAGTCCGC 


GAACGGGATC 


GGCCGGTGGT 


GTCGCATCAG 


33840 


GCCCCGCAGC 


AGTCGCTGCC 


GCCGCXX5CTC 


CGTCAAGCTG 


CTGC?TCAGGG 


GGTCCGGGTC 


33900 


CAGGGACTCC 


CTCAGCATGA 


TGCCCACGGC 


CCTCAGCATC 


AGTCX^TCTGG 


TGCGGCGGGC 


33960 


GCAGCAGCGC 


AT6C6GATCT 


CGCTCAGGTC 


GCTGCAGTAC 


GTGCAACACA 


GAACCACCAG 


34020 


gttgttcaac 


AGTCCATAGT 


TCAAGAOGCT 


CCAGCCGAAA 


CTCATCGOGG 


GAAGGATGCT 


34080 


ACCCACGTGG 


CCGTC6TACC 


AGATCCTCAG 


GTAAATCAAG 


TGG76CCCCC 


TCCAGAACAC 


34140 


GCTGCCCACG 


TACAT6ATCT 


CCTTOGGGAT 


GTGGCGGTTC 


ACCACCTCCC 


GGTACCACAT 


34200 


CACCCTCTGG 


TTGAACATGC 


AGCCCC6GAT 


GATCCTGCGG 


AACCACAGGG 


CCAGCACCGC 


34260 


CCCGCCCGCC 


ATGCAGCGAA 


GAGACCCOGG 


GTCCCGGCAA 


TGGCAATGGA 


GGACCCACCG 


34320 


CTCGTACCCG 


TGGATCATCT 


GGGAGCTGAA 


CAAGTCTATG 


TTGGCACAGC 


ACAGGCATAT 


34380 


GCTCATGCAT 


CTCTTCAGCA 


CTCTCAACTC 


CTCGGGGGTC 


AAAACCATAT 


CCCAGGGCAC 


34440 


GGGGAACTCT 


TGCAGGACAG 


CGAACCCCGC 


AGAACAGGGC 


AATCCTCGCA 


CAGAACTTAC 


34500 


ATTGTGCATG 


GACAGGGTAT 


CGCAATCAGG 


CAGCACCGGG 


TGATCCTCCA 


CCAGAGAAGC 


34560 


GCGGGTCTCG 


GTCTCCTCAC AG0GTG6TAA 


GGGGGCCGGC 


CGATAOGGGT 


GATGGOGGGA 


34620 


CGCGGCTGAT 


CGTGTTCGOG 


ACCGTGTCAT 


GATGCAGTTG 


CTTTCGGACA 


TTTTC6TACT 


34680 


TGCTGTAGCA 


GAACCTGGTC 


OGGGCGCTGC 


ACACOGATOG 


CCGGCGGOGG 


TCTCGGCGCT 


34740 


TGGAACGCTC 


GGTGTTGAAA TTGTAAAACA 


GCCACTCTCT 


CA6ACCGTGC 


AGCAGATCTA 


34800 


GGGCCTCAGG 


AGTGATGAAG 


ATCCCATCAT 


GCCTGATGGC 


TCTGATCACA 


TOGACCACCG 


34860 


TGGAATGGGC 


CAGACCCAGC 


CAGATGATGC 


AATTTTGTTG 


GGTTTCGGTG 


ACGGCGGGGG 


34920 


AGGGAAGAAC 


AGGAAGAACC 


ATGATTAACT 


TTTAATCCAA 


ACGGTCTCGG 


AGTACTTCAA 


34980 


AATGAAGATC 


GCGGAGATGG 


CACCTCTCGC 


CCCCGCTGTG 


TTGGTGGAAA 


ATAACAGCCA 


35040 


6GTCAAAGGT 


GATACGGTTC 


TCGAGATGTT 


CCACGGTGGC 


TTCCAGCAAA 


GCCTCCACGC 


35100 


GCACATCCAG 


AAACAAGACA 


ATA6CGAAAG 


CGGGAGGGTT 


CTCTAATTCC 


TCAATCATCA 


35160 
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TGTTACACTC 


CTGCACCATC 


CCCAGATAAT 


TTTCATTTTT 


CCAGCCTTGA 


ATGATTCGAA 


35220 


CTAGTTCGTG 


AGGTAAATCC 


AAGCCAGCCA 


TGATAAAGAG 


CTCGCGCAGA 


GCGCCCTCCA 


35280 


CCGGCATTCT 


TAAGCACACC 


CTCATAATTC 


CAAGATATTC 


TGCTCCTGGT 


TCACCTGCAG 


35340 


CAGATTGACA 


AGCGGAATAT 


CAAAATCTCT 


GCCGCGATCC 


CTGAGCTCCT 


CCCTCAGCAA 


35400 


TAACTGTAAG 


TACTCTTTCA 


TATCCTCTCC 


GAAATTTTTA 


GCCATAGGAC 


CACCAGGAAT 


35460 


AAGATTAGGG 


GAAGCCACA6 


TACAGATAAA 


CCGAAGTCCT 


CCCCAGTGAG 


CATTGCCAAA 


35520 


TGCAAGACTG 


CTATAAGCAT 


GCTGGCTAGA 


CCCGGTGATA 


TCTTCCAGAT 


AACTGGACAG 


35580 


AAAATCGCCC 


AGGCAATTTT 


TAAGAAAATC 


AACAAAAGAA AAATCCTCCA 


GGTGGAOGTT 


35640 


TAGAGCCTCG 


GGAACAACGA 


TGAAGTAAAT 


GCAAGCGGTG 


CGTTCCAGCA 


TGGTTAGTTA 


35700 


GCTGATCTGT 


AGAAAAAACA 


AAAATGAACA 


TTAAACCATG 


CTAGCCTGGC 


GAACAGGTGG 


35760 


GTAAATCGTT 


CTCTCCAGCA 


CCAGGCAGGC 


CACGGGGTCT 


CCGGCGCGAC 


CCTCGTAAAA 


35820 


ATTGTCGCTA 


TGATTGAAAA 


CCATCACAGA 


GAGACGTTCC 


GGGTGGCCGG 


CGTGAATGAT 


35880 


TCGACAAGAT 


GAATACACCC 


COGGAACATT 


GGCGTCCGCG 


AGTGAAAAAA 


AGCGCCCGAG 


35940 


GAAGCSAATAA 


GGCACTACAA 


TGCTCAGTCT 


CAAGTCCAGC 


AAAGCGATGC 


CATGCGGATG 


36000 


AA6CACAAAA 


TTCTCA6GTG 


OGTACAAAAT 


GTAATTACTC 


CCCTCCTGCA 


CAGGCAGCAA 


36060 


A6CCCCCGAT 


CCCTCCAGGT 


ACACATACAA 


AGCCTCAGCG 


TCCATAGCTT 


ACCGAGCA6C 


36120 


A6CACACAAC 


AGGCGCAAGA 


GTCAGAGAAA 


GGCTGAGCTC 


TAACCTGTCC 


ACCCGCTCTC 


36180 


TGCTCAATAT 


ATAGCCCAGA 


TCTACACTGA 


CGTAAAGGCC 


AAAGTCTAAA 


AATACCCGCC 


36240 


AAATAATCAC 


ACAOGCCCAG 


CACACGCCCA 


GAAACCGGTG 


ACACACTCAA 


AAAAATACGC 


36300 


GCACTTCCTC 


AAACGCCCAA 


AACTGCOGTC 


ATTTCCGGGT 


TCCCACGCTA 


CGTCATCAAA 


36360 


ACACGACTTT 


CAAATTCCGT 


CGACCGTTAA 


AAACGTCACC 


CGCCCCGCCC 


CTAACGGTCG 


36420 


CCCGTCTCTC 


AGCCAATCAG 


CGCCCCGCAT 


CCCCAAATTC 


AAACACCTCA 


TTTGCATATT 


36480 


AACGC6CACA 


AAAAGTTTGA 


GGTATATTAT 


TGATGATGG 






36519 



(2) INFORMATION FOR SEQ ID N0t3: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 8299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTIONi SEQ ID NO: 3: 
GCCCAATAOG CAAACCGCCT CTCCCC6CGC GTTGGCCGAT TCATTAATGC AGCTGCGC6C 60 
TOGCTCGCTC ACTGA6GCCG CCCGGGCAAA GCCCGGGCGT CGGGGGACCT TTGGTCGCCC 120 
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GOCCTCAGTG 


AGCGAGCGAG 


CGCGCAGAGA 


GGGAGTGGCC 


AACTCCATCA 


CTAGGGGTTC 


180 


CTTGTAGTTA 


ATGATTAACC 


CGCCATGCTA CTTATCTACA 


TCATCGATGA 


ATTCGAGCTT 


240 


GCATGCCTGC 


A6GTCGTTAC 


ATAACTTAOG 


GTAAATGGCC 


OGCCTCGCTG 


ACCGCCCAAC 


300 


GACCCCCGCC 


CATTGACGTC 


AATAATGACG 


TATGTTCOCA 


TAGTAAOGCC 


AATAGGGACT 


360 


TTCGATTGAC 


GTCAATGGGT 


GGAGTATTTA 


OGGTAAACTC 


CCCACTTGGC 


AGTACATCAA 


420 


GTGTATCATA 


TGCCAAGTAC 


GCCCCCTATT GACGTCAATG 


A06GTAAATG 


GCCCGCCTGG 


480 


CATTATGCCC 


AGTACATGAC 


CTTATGGGAC 


TTTCCTACTT 


GGCA6TACAT 


CTACGTATTA 


540 


GTCATCGCTA 


TTACCATGGT 


6ATGCGGTTT 


TGGCAGTACA 


TCAATGGGCG 


TGGATAGCGG 


600 


TTTGACTCAC 


GGGGATTTCC 


AAGTCTCCAC 


CCCATTGACG 


TCAATGGGAG 


TTTGTTTTGG 


660 


CACCAAAATC 


AACGGGACTT 


TCCAAAATGT 


CGTAACAACT 


CCGCCCCATT 


GACGCAAATG 


720 


GGCGGTAGGC 


GTGTAGGGTG 


GGAGGTCTAT 


ATAAGCAGAG 


CTCGTTTAGT 


GAACCGTCAG 


780 


ATCX3CCT6GA 


GACGCCATCC 


ACGCTGTTTT 


GACCTCCATA 


GAAGACACCG 


GGACCGATCC 


840 


AGCCTCCGGA 


CTCTAGAGGA 


TCCGGTACTC 


GACCC6AGCT 


CGGATCCACT 


AGTAACGGCC 


900 


GCCAGTGTGC 


TGGAATTCTG 


CACTCCAGGC 


TGCCCGGGTT 


TGCATGCTGC 


TGCTGCTGCT 


960 


GCTGCTGGGC 


CTGAGGCTAC 


A6CTCTCCCT 


GGGCATCATC 


CTAGTTGA6G 


AGGAGAACCC 


1020 


G6ACTTCTGG 


AACCGCGAGG 


CAGCOGAGGC 


CCTG6GTGCC 


GCCAAGAAGC 


TGCAGCCTGC 


1080 


ACAGACAGCC 


GCCAAGAACC 


TGATCATCTT 


CCTGGGCGAT 


6GGATG6GGG 


TGTCTACGGT 


1140 


GACAGCTGCC 


AGGATCCTAA 


AAGGGCAGAA 


GAAGGACAAA 


CTGGGGCCTG 


AGATACCCCT 


1200 


GGCCAT6GAC 


CGCTTCCCAT 


ATGTGGCTCT 


GTCCAAGACA 


TACAATGTAG 


ACAAACATGT 


1260 


GCCAGACA6T 


GGAGCCACAG 


CCACGGCCTA 


CCTGTGCGG6 


GTCAAGGGCA 


ACTTCCAGAC 


1320 


CATTGGCTTG 


AGTGCAGCCG 


CCCGCTTTAA 


CCAGTGCAAC 


ACGACACGCX3 


GCAACGAGGT 


1380 


CATCTCCGTG 


ATGAATG6GG 


CCAAGAAAGC 


AGGGAAGTCA 


GTGGGAGTGG 


TAACCACCAC 


1440 


A06AGTGCAG 


CACGCCTCX;C 


CAGCCGGCAC CTAOGCCCAC 


ACGGTGAACC 


GCAACTGGTA 


1500 


CTCGGACGCC 


GACGTGCCTG 


CCTCGGCCCG 


CCAGGAGGGG 


TGCCAGGACA 


TCGCTA06CA 


1560 


GCTCATCTCC 


AACATGGACA 


TTGATGTGAT 


CCTAG6TGGA 


GGCCGAAAGT 


ACATGTTTCG 


1620 


CATGGGAACC 


CCAGACCCTG 


AGTACCCAGA TGACTACAGC 


CAAGGTGGGA 


CCAGGCTGGA 


1680 


CSGGGAAGAAT 


CTGGTGCAGG 


AATGGCTOGG 


CGAACGCCAG 


GGTGCCCGGT 


ACGTGTGGAA 


1740 


CC6CACTGAG 


CTCATGCAGG 


CTTCCCTGGA 


CCCGTCTGTG 


ACCCATCTCA 


TGGGTCTCTT 


1800 


TGAGCCTGGA 


GACATGAAAT 


ACGAGATCCA 


CCGAGACTCC 


ACACTGGACC 


CCTCCCTGAT 


1860 


GGAGATGACA 


GAGGCTGCCC 


TGCGCCTGCT 


GAGCAGACAC 


CCCCGCGGCT 


TCTTCCTCTT 


1920 


CGTGGAGGGT 


GGTCGCATOG 


ACCATGGTCA 


TCATGAAAGC 


AGGGCTTACC 


GGGCACTGAC 


1980 


TGA6ACGATC 


ATGTTCGACG 


ACGCCATT6A 


GAGGGCGGGC 


CAGCTCACCA 


GCGAGGAGGA 


2040 
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CACGCTGAGC 


CTCGTCACTG 


CCGACCACTC CCACGTCTTC 


TCCTTCGGAG 


GCTACCCCCT 


2100 


6C6AGG6A6C 


TCCTTCATCG 


GGCTGGCCGC TGGCAAGGCC 


CGGGACAGGA 


AGGCCTACAC 


2160 


6GTCCTCCTA 


TACGGAAAOG 


GTCCAGGCTA TGTGCTCAAG 


GACGGCGCCC 


GGCCGGATGT 


2220 


TAC06AGAGC 


GAGAGCGGGA 


GCCCCGAGTA TOGGCAGCAG 


TCAGCAGTGC 


CCCTGGA06A 


2280 


A6A6ACCCAC 


GCAGGC6AGG 


AOGTGGCGGT GTTOGCGOGC 


GGCCCGCAGG 


CGCACCTGGT 


2340 


TCA06GCGTG 


CAGGAGCAGA 


CCTTCATAGC GCACGTCATG 


GCCTTCX3CCG 


CCTGCCTGGA 


2400 


GCCCTACACC 


GCCTGCGACC 


TGGCGCCCCC CGCCGGCACC 


ACCGACGCCG 


CGCACCCGGG 


2460 


GCGGTCC6TG 


GTCCCCGCGT 


TGCTTCCTCT GCTGGCOGGG 


ACCCTGCTGC 


TGCTGGAGAC 


2520 


GGCCACTGCT 


CCCTGAGTGT 


CCCX3TCCCTG GGGCTCCTGC 


TTCOCCATCC 


CGGAGTTCTC 


2580 


CTGCTCCCCA 


CCTCCTGTCG 


TCCTGCCTGG CCTCCAGCCC 


GAGTCGTCAT 


CCCCGGAGTC 


2640 


CCTATACAGA 


GGTCCTGCCA 


TGGAACCTTC CCCTCCCCGT 


GCGCTCTGGG 


GACTGAGCCC 


2700 


ATGACACCAA 


ACCTGCCCCT 


TGGCTGCTCT CGGACTCCCT 


ACCCCAACCC 


CAGGGACTGC 


2760 


AGGTTGTGCC 


CTGTGGCTGC 


CTGCACCCCA GGAAAGGAGG 


GGGCTCAGGC 


CATCCAGCCA 


2820 


CCACCTACAG 


CCCAGTGGGG 


TCGAGACAGA TGGTCAGTCT 


GGAGGATGAC 


GTGGCGTGAA 


2880 


GCTGGCCGCG 


GGGATCCAGA 


CATGATAAGA TACATTGATG 


AGTTTGGACA 


AACCACAACT 


2940 


AGAATGCAGT 


GAAAAAAATG 


CTTTATTTGT GAAATTTGTG 


ATGCTATTGC 


TTTATTTGTA 


3000 


ACCA7TATAA 


GCTGGAATAA 


ACAA6TTAAC AACAACAATT 


GCATTCATTT 


TATGTTTCAG 


3060 


GTTCAGGGGG 


AGGTGTGGGA 


GGTTTTTTCG GATCCTCTAG 


AGTOGACTCT 


AGANNNNNNN 


3120 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3180 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3240 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3300 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3360 


NHNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN NNNGGATCCC 


CATGACTACG 


TCCGGCGTTC 


3420 


GATTTGGCAT 


GACACTACGA 


CCAACACGAT CTOGGTTGTC 


TCGGOGCACT 


CCGTACA6TA 


3480 


GGGATOGTCT 


ACCTCCTTTT 


GAGACA6AAA CCCGCGCTAC 


CATACTGGAG 


GATCATCCGC 


3540 


TGCTGCCOGA 


ATGTAACACT 


TTGACAATGC ACAAOGTGAG 


TTACTGTGCGA 


GGTCTTCCCT 


3600 


GCA6TGTGG6 


ATTTACGCTG 


ATTCAGGAAT GGGTTGTTCC 


CTGGGATATG 


GTTCTAACGC 


3660 


GGGAGGAGCT 


TGTAATCCTG 


AGGAAGTGTA TGCAOGTGTG 


CCTGTGTTGT 


GCCAACATTG 


3720 


ATATCATGAC 


GAGCATGAT6 


ATCCATGGTT ACGAGTCCTG 


GGCTCTCCAC 


TGTCATTGTT 


3780 


CCAGTCCCGG 


TTCCCTGCAG 


TGTATAGCCG GCGGGCAGGT 


TTTGGCCAGC 


TGGTTTAGGA 


3840 


TGGTGGTGGA 


TGGCGCCATG 


TTTAATCAGA GGTTTATATG 


GTACCGGGAG 


GTGGTGAATT 


3900 


ACAACATGCC 


AAAAGAGGTA 


ATGTTTATGT CCAGCGTGTT 


TATGAGGGGT 


CX3CCACTTAA 


3960 
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TCTACCTGCG 


CTTGTGGTAT 


GATGGCCACG 


TGGGTTCTGT 


GGTCCCCGCC 


ATGAGCTTTG 


4020 


GATACAGCGC 


CTTGCACTGT 


GGGATTTTGA 


ACAATATTGT 


GGTGCTGTGC 


TGCAGTTACT 


4080 


GTGCTGATTT 


AAGTGAGATC 


AGGGT6CGCX 


GCTGTGCCCG 


GAGGACAAGG 


CGCCTTAT6C 


4140 


TGOGGGCGGT 


GCGAATCATC 


GCT6AG6AGA 


CCACTGCCAT 


GTTCTATTCC 


T6CAGGA06G 


4200 


AGOGGCGGOG 


GCA6CAGTTT 


KTTCGOQCGC 


TGCTGCAGCA 


CGAOCGCCCT 


ATCCTGATGC 


4260 


AOGATTATGA 


CTCTACCCCC 


ATGTAGGGAT 


CCCCATCACT 


AGTGOGGCCG 


CGGGGATCCA 


4320 


GACATGATAA 


GATACATT6A 


TGAGTTTGGA 


CAAACCACAA 


CTAGAATGCA 


GTGAAAAAAA 


4380 


T6CTTTATTT 


6TGAAATTTG 


TGATGCTATT 


GCTTTATTTG 


TAACCATTAT 


AAGCTGCAAT 


4440 


AAACAAGTTA 


ACAACAACAA 


TTGCATTCAT 


TTTATGTTTC 


AGGTTCAGGG 


GGAGGTGTGG 


4500 


GAGGTTTTTT 


OGGATCCTCT 


AGAGTCGACC 


TGCAGGCATG 


CAAGCTGTAG 


ATAAGTAGCA 


4560 


TGGCGGGTTA 


ATCATTAACT 


ACAAGGAACC 


CCTAGTGATG 


GAGTTGGCCA 


CTCCCTCTCT 


4620 


G06CGCTCGC 


TCGCTCACTG 


AGGCCGGGCG 


ACCAAAGGTC 


GCCCGACGCC 


CGGGCTTTGC 


4680 


COGGGCGGCC 


TCAGTGAG06 


AGCGAGCGCG 


CAGCTGGCGT 


AATAGCGAAG 


AGGCCCGCAC 


4740 


CGATCGCCCT 


TCCGAACA6T 


TGCX;CAGCCT 


GAATGGCGAA 


TGGAANTTCC 


AGACGATTGA 


4800 


G06TCAAAAT 


GTAGGTATTT 


CCATGAGCGT 


TTTTCCTGTT 


GCAATGGCTG 


GCGGTAATAT 


4860 


TGTTCTGGAT 


ATTACCA6CA 


AGGCC6ATAG 


TTTGAGTTCT 


TCTACTCAGG 


CAAGTGATGT 


4920 


TATTACTAAT 


CAAAGAAGTA 


TTGC6ACAAC 


GGTTAATTTG 


CGTGATGGAC 


AGACTCTTTT 


4980 


ACTCGGTGGC 


CTCACTGATT 


ATAAAAACAC 


TTCTCAGGAT 


TCTGGCGTAC 


OGTTCCTGTC 


5040 


TAAAATCCCT 


TTAATCGGCC 


TCCTGTTTAG 


CTCCCGCTCT 


GATTCTAACG 


AGGAAAGCAC 


5100 


GTTATACGTG 


CTCGTCAAAG 


CAACCATAGT 


ACGCGCCCTG 


TAGCGGCX;CA 


TTAAGCGCGG 


5160 


CGGGTGTGGT 


GGTTACGCXSC 


AGCGTGACCG 


CTACACTTGC 


CAGCGCCCTA 


GCGCCCGCTC 


5220 


CTTTCGCTTT 


CTTCCCTTCC 


TTTCT06CCA 


CGTTCGCCGG 


CTTTCCCCGT 


CAAGCTCTAA 


5280 


ATCG6GGGCT 


CCCTTTAGGG 


TTCCGATTTA 


6TGCTTTACG 


GCACCTCGAC 


CCCAAAAAAC 


5340 


TTGATTAGGG 


TGATGGTTCA 


CGTAGTGGGC 


CATCGCCCTG 


ATAGA0G6TT 


TTTCGCCCTT 


5400 


TGACGTTGGA 


GTCCACGTTC 


TTTAATAGTG 


GAGTCTTGTT 


CCAAACTGGA 


ACAACACTCA 


5460 


ACCCTATCTC 


GGTCTATTCT 


TTT6ATTTAT 


AAGGGATTTT 


GCCGATTTCG 


GCCTATTGGT 


5520 


TAAAAAATGA 


GCTGATTTAA 


CAAAAATTTA 


ACGCGAATTT 


TAACAAAATA 


TTAACGTTTA 


5580 


CAATTTAAAT 


ATTTGCTTAT 


ACAATCTTCC 


TGTTTTTGGG 


GCTTTTCTGA 


TTATCAACCG 


5640 


GGGTACATAT 


GATTGACATG 


CTAGTTTTAC 


GATTACCGTT 


CATCGATTCT 


CTTGTTTGCT 


5700 


CCAGACTCTC 


AGGCAATGAC 


CTGATAGCCT 


TTGTAGAGAC 


CTCTCAAAAA 


TAGCTACCCT 


5760 


CTCCGGCATG 


AATTTATCAG 


CTAGAACGGT 


TGAATATCAT 


ATTGATGGTG 


ATTTGACTGT 


5820 


CTCCGGCCTT 


TCTCACCCGT 


TTGAATCTTT 


ACCTACACAT 


TACTCAGGCA 


TTGCATTTAA 


5880 
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AATATATGAG 


GGTTCTAAAA 


ATTTTTATCC 


TTGCGTTGAA 


ATAAAGGCTT 


CTCCCGCAAA 


5940 


AGTATTACA6 


GGTCATAATG 


TTTTT6GTAC 


AACXXvATTTA 


GCTTTATGCT 


CTGAGGCTTT 


6000 


ATTGCTTAAT 


TTTGCTAATT 


CTTTGCCTTG 


CCTGTATGAT 


TTATTGGATG 


TTGGAANTTC 


6060 


CTGATGCX3GT 


ATTTTCTCCT 


TACGCATCTG 


TGCGGTATTT 


CACAC06CAT 


ATGGTGCACT 


6120 


CTCAGTACAA 


TCTGCTCTGA TGCCGCATA6 


TTAAGCCACC 


CC06ACACCC 


GCCAACACCC 


6180 


GCTGACGCGC 


OCTGACGGGC 


TTGTCTGCTC 


COGGCATCOG 


CTTACA6ACA 


AGCT6T6ACC 


6240 


GTCTCCGGGA 


GCTGCATGTG 


TCAGAGGTTT 


TCACOGTCAT 


CACCGAAACG 


CGCGAGAOGA 


6300 


AAGGGCCTCG 


TGATAC6CCT 


ATTXTTATAG 


GTTAATGTCA 


TGATAATAAT 


GGTTTCTTAG 


6360 


ACGTGAGGTG 


GCACTTTTCG 


GGGAAATGTG 


CGCGGAACCC 


CTATTTGTTT 


ATTTTTCTAA 


6420 


ATACATTCAA 


ATATGTATCC 


GCTCATGAGA 


CAATAACCCT 


GATAAATGCT 


TCAATAATAT 


6480 


TGAAAAAGGA 


AGAGTATGAG 


TATTCAACAT 


TTCCGTGTCX3 


CCCTTATTCC 


CTTTTTTGCG 


6540 


GCATTTTGCC 


TTCCTGTTTT 


TGCTCACCCA 


GAAACGCTGG 


TGAAAGTAAA 


AGATGCTGAA 


6600 


GATCAGTTGG 


GTGCACGAGT 


GGGTTACATC 


GAACTGGATC 


TCAACAGCGG 


TAAGATCCTT 


6660 


GAGAGTTTTC 


GCCCCGAAGA ACGTTTTCCA 


ATGATGAGCA 


CTTTTAAAGT 


TCTGCTATGT 


6720 


GGCGOGGTAT 


TATCCCGTAT 


TGACGCOGGG 


CAAGAGCAAC TCGGTCGCCG 


CATACACTAT 


6780 


TCTCAGAATG 


ACTTGGTTGA 


GTACTCACCA 


GTCACAGAAA 


AGCATCTTAC 


GGATGGCATG 


6840 


ACAGTAAGAG 


AATTATGCAG 


TGCTGCCATA 


ACCATGAGTG 


ATAACACTGC 


GGCCAACTTA 


6900 


CTTCTGACAA 


CGATCGGA6G 


ACCX^AAGGAG 


CTAACCGCTT 


TTTTGCACAA 


CATGGGGGAT 


6960 


CAT6TAACTC 


GCCTTGATCG 


TTGGGAACCG 


GAGCTGAATG 


AAGCCATACC 


aaaogacx;ag 


7020 


CGTGACACCA 


CGATGCCTGT 


AGCAATGGCA 


ACAACGTTGC GCAAACTATT 


AACTGGCGAA 


7080 


CTACTTACTC 


TAGCTTCCCG 


GCAACAATTA 


ATAGACTGGA 


TGGAGGCGGA 


taaagttgca 


7140 


G6ACCACTTC 


TGCGCTCGGC 


CCTTCCGGCT 


GGCTGGTTTA 


TTGCTGATAA 


ATCTGGAGCC 


7200 


GGTGAGC6TG 


GGTCTCGCGG 


TATCATTGCA 


GCACTGGGGC 


CAGATGGTAA 


GCCCTCCCGT 


7260 


ATCGTAGTTA 


TCTACAC6AC 


GGGGAGTCAG 


GCAACTATGG 


ATGAACGAAA 


TAGACAGATC 


7320 


GCTGAGATAG 


GT6CCTCACT 


GATTAAGCAT 


TGGTAACTGT 


CAGACCAAGT 


TTACTCATAT 


7380 


ATACTTTAGA 


TT6ATTTAAA 


ACTTCATTTT 


TAATTTAAAA GGATCTAGGT 


GAAGATCCTT 


7440 


TTTGATAATC 


TCATGACCAA 


AATCCCTTAA 


CGTGAGTTTT 


CGTTCCACTG 


A6CGXCAGAC 


7500 


CCC6TAGAAA 


AGATCAAAGG 


ATCTTCTTGA 


GATCCTTTTT TTCTGCGCGT 


AATCTGCTGC 


7560 


TTGCAAACAA 


AAAAACCACC 


GCTACCAGCG 


GTGGTTTGTT 


T6CCGGATCA 


AGAGCTACCA 


7620 


ACTCTTTTTC 


CGAAGGTAAC 


TGGCTTCAGC 


AGAGCGCAGA 


TACCAAATAC 


TGTCCTTCTA 


7680 


GTGTAGCCGT 


AGTTAGGCCA 


CCACTTCAAG 


AACTCTGTAG 


CACCGCCTAC 


ATACCTCGCT 


7740 


CTGCTAATCC 


TGTTACCAGT 


GGCTGCTGCC 


AGTGGCGATA 


AGTCGTGTCT 


TACCGGGTTG 


7800 
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GACTCAAGAC GATAGTTACC GGATAAGGCG CAGCGGTCGG GCTGAACGGG GGGTTCGTGC 7860 

ACACAGCCCA GCTTG6AGCG AACOACCTAC ACCGAACTGA GATACCTACA GCGTGAGCTA 7920 

TGAGAAAG06 CGAOGCTTCC 06AAGGGAGA AAG60GGAGA GGTATCCGGT AAGOGGCA6G 7980 

GT066AACAG GAGAGCGGAC 6AGGGAGCTT CCAGGGGGAA AOGCCTGGTA TCTTTATAGT 8040 

CCTGTCGGGT TTOGCCACCT CT6ACTT6AG OGT06ATTTT TGTGATGCTC GTCAGGG6GG 8100 

OGGAGCCTAT GGAAAAACGC CAGCAAOGOG GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG 8160 

OCT T TT G CTC ACATCTTCTT TCCT60GTTA TOCCCTGATT CTGTGGATAA COGTATTACC 8220 

GCCTTTGAGT GAGCTGATAC CGCTOGCCGC AGCCGAACGA CCGAGCGCAG CGAGTCAGTG 8280 

AGCGAGGAAG CGGAAGAGC B299 

(2) INFORMATION FOR SEQ ID NO: 4$ 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: eingle 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GCAGGTACCG CGAGTCAGAT CTACAC 26 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
CTGTCTGAGC TAGAGCTC 
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WHAT IS CLAIMED IS: 

!• A vector comprising a chimpanzee 
adenovirus DNA sequence and a selected heterologous gene 
operatively linked to regulatory sequences which direct 
expression of said gene in a heterologous host cell. 

2. The vector according to claim 1 wherein 
said chimpanzee adenovirus sequence comprises at least 5» 
and 3 • cis-elements necessary for replication and virion 
encapsidation, said cis-elements flanking said selected 
gene and regulatory sequences. 

3. The vector according to claim 1 wherein 
said chimpanzee adenovirus sequence has a deletion in all 
or a part of the El gene. 

4. The vector according to claim 1 wherein 
said chimpanzee adenovirus sequence comprises the 
sequence of SEQ ID NO: 1 or a fragment thereof. 

5. The vector according to claim 1 wherein 
said chimpanzee adenovirus sequence comprises the 
sequence of SEQ ID NO: 2 or a fragment thereof. 

6. A host cell transfected with the vector of 

claim 1. 

7. A human cell that expresses a selected gene 
introduced therein through transduction of the vector of 
claim 1. 

8. A non-simian mammalian cell line that 
expresses a chimpanzee adenovirus gene. 
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9. The cell line according to claim 8 wherein 
said gene is an adenovirus El gene or a functional 
fragment of said El gene. 

10. The cell line according to claim 8 
wherein said chimpanzee adenovirus gene is obtained from 
the sequence of SEQ ID NO: 1. 

11. The cell line according to claim 8 wherein 
said chimpanzee adenovirus gene is obtained from the 
sequence of SEQ ID NO: 2. 

12. A pharmaceutical composition comprising a 
recombinant adenovirus vector in a pharmaceutically 
acceptable carrier, said vector comprising a chimpanzee 
adenovirus DNA sequence and a selected heterologous gene 
operatively linked to regulatory sequences which direct 
expression of said gene in a host cell. 

13. A method for delivering a heterologous 
gene to a mammalian cell comprising introducing into said 
cell an effective amount of the vector of claim 1. 

14. A method for producing a selected gene 
product comprising infecting a mammalian cell with the 
vector of claim 1, culturing said cell under suitable 
conditions and isolating and recovering from said cell 
culture the expressed gene product. 
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15. The use of a vector comprising a 
chimpanzee adenovirus DMA sequence and a selected 
heterologous gene encoding an antigen of an infective 
agent operatively linked to regulatory sequences which 
direct expression of said gene in the production of a 
medicament for eliciting an immune response in a 
mammalian host against said infective agent. 

16. The use of a vector comprising a 
chimpanzee adenovirus DNA sequence and a selected 
heterologous therapeutic gene operatively linked to 
regulatory sequences which direct expression of said gene 
in the production of a medicament for treating a patient 
having an acquired or inherited genetic defect. 
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